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Abstract Observations of climate change during the CMIP5 extended historical period (1850-2012) 
are compared to trends simulated by six versions of the NASA Goddard Institute for Space Studies Mod- 
elE2 Earth System Model. The six models are constructed from three versions of the ModelE2 atmos- 
pheric general circulation model, distinguished by their treatment of atmospheric composition and the 
aerosol indirect effect, combined with two ocean general circulation models, HYCOM and Russell. Forc- 
ings that perturb the model climate during the historical period are described. Five-member ensemble 
averages from each of the six versions of ModelE2 simulate trends of surface air temperature, atmos- 
pheric temperature, sea ice and ocean heat content that are in general agreement with observed trends, 
although simulated warming is slightly excessive within the past decade. Only simulations that include 
increasing concentrations of long-lived greenhouse gases match the warming observed during the twen- 
tieth century. Differences in twentieth-century warming among the six model versions can be attributed 
to differences in climate sensitivity, aerosol and ozone forcing, and heat uptake by the deep ocean. 
Coupled models with HYCOM export less heat to the deep ocean, associated with reduced surface 
warming in regions of deepwater formation, but greater warming elsewhere at high latitudes along with 
reduced sea ice. All ensembles show twentieth-century annular trends toward reduced surface pressure 
at southern high latitudes and a poleward shift of the midlatitude westerlies, consistent with 
observations. 


1. Introduction 

Confidence in projections of future climate depends upon our ability to simulate and understand observed 
changes. The two centuries since the start of the industrial revolution offer an increasingly extensive set of 
observations of a warming climate that is apparent in trends of temperature (both at the surface and aloft), 
ocean heat content, and cryosphere. 

The period of 1850 to the present has been chosen for coordinated climate modeling experiments compris- 
ing Phase 5 of the Coupled Model Intercomparison Project (CMIP5) [Taylor et al., 201 2], This is designated as 
the "historical period", whose beginning is chosen prior to significant anthropogenic forcing and climate 
change, but sufficiently recent to contain observations that allow extensive evaluation of models. The his- 
torical experiments are part of a larger set of CMIP5 coordinated experiments that have informed the fifth 
and latest assessment report (AR5) by the Intergovernmental Panel on Climate Change (IPCC), who summa- 
rize our understanding of the changing climate. 
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The main perturbation to climate over the historical period is the change in atmospheric composition: pri- 
marily the increasing concentration of greenhouse gases and aerosols. Long-lived greenhouse gases like 
carbon dioxide show small geographic variations in concentration and can thus be characterized by meas- 
urements at a few sites. However, aerosols and short-lived greenhouse gases like ozone show much larger 
spatial contrasts, challenging our ability to measure their concentration with sufficient spatial and temporal 
detail. The perturbation to the radiative budget by aerosols opposes the effect of long-lived greenhouse 
gases, but the precise offset remains uncertain. Imprecise knowledge of aerosol forcing allows models with 
a range of climate sensitivity to reproduce the warming already observed, despite divergent projections of 
future warming [Kiehl, 2007]. The rate of atmospheric warming is tied to the response of the upper ocean 
(whose depth is defined by the extent of rapid vertical mixing of heat, typically a hundred meters or so). 
Thus, the forced surface response can be delayed by the export of heat to the deep ocean whose tempera- 
ture is not directly tied to that of the overlying atmosphere. This export is not well constrained by observa- 
tions so a model's excessive climate sensitivity can be masked during simulation of the historical period by 
overestimating heat uptake by the deep ocean. In summary, successful simulation of the observed climate 
is not a guarantee of reliable future projections because the agreement may result spuriously from cancella- 
tion of errors in the prescribed forcing, model sensitivity, and heat uptake by the deep ocean. Despite these 
challenges, the ability to simulate observed trends during the historical period remains an important test of 
a climate model. Historical simulations provide the greatest opportunity for constraining model physics and 
weighing model projections for the future, not only by comparison to observed trends, but also observed 
climate processes [ Sherwood et al., 2014], 

ModelE2, the latest generation of the NASA Goddard Institute for Space Sciences (GISS) coupled general cir- 
culation model (CGCM), was prepared for participation in CMIP5. This article, a description of the model 
response to forcing during the historical period, is part of a trilogy. Schmidt et al. [2014a] compare the clima- 
tology of ModelE2 to observations during the satellite era that spans the most recent decades. Projections 
of future warming by ModelE2 in response to anticipated changes in atmospheric composition will be 
described elsewhere. 

This article has three objectives. First, we document the experimental design of the historical simulations 
and the forcings that perturb the climate (sections 2 and 3, respectively). This allows comparable experi- 
ments to be carried out during future model development. In addition, our description of the forcings pro- 
vides a quantitative measure of the radiative imbalance leading to the simulated climate change. Second, 
we compare the model in section 4 to standard global data sets that extend across several decades and 
characterize climate change since the preindustrial period. We emphasize connections between trends of 
different quantities like temperature, ocean heat transport, and sea ice. This integrated comparison is 
intended to provide context for subsequent studies that are more detailed but more narrow in scope. In 
section 4, we address our third goal of identifying how the model response varies as a result of different 
treatments of the ocean circulation and the aerosol indirect effect. Our conclusions are presented in 
section 5. 


2. Model Description 

Here we summarize the main features of ModelE2, referring the reader to Schmidt et al. [2014a] for greater 
detail. 

2.1 . Atmospheric Model 

The ModelE2 atmospheric general circulation model (AGCM) has resolution of 2.5° longitude by 2° latitude 
and 40 vertical layers. The hydrostatic approximation is introduced through the use of pressure as a vertical 
coordinate, with terrain-following sigma coordinates in the 23 layers below 150 mb. Above this pressure 
level, the layer thickness is horizontally uniform to the model top at 0.1 mb. Vertical resolution is enhanced 
near the lower boundary, with six layers below 825 mb, and the tropopause, with nine layers between 251 
and 43 mb. 

Three versions of ModelE2 were used for the CMIP5 experiments, distinguished by their treatment of atmos- 
pheric composition and the aerosol indirect effect (AIE). Aerosols perturb radiative fluxes indirectly by acting 
as cloud condensation nuclei (CCN), altering cloud brightness and lifetime. Specifically, aerosols reduce 
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cloud droplet size by distributing condensed water over a greater number of nucleation sites (the "Two- 
mey" or first AIE) [e.g., Twomey, 1977], and these smaller droplets increase the number of collisions required 
for the droplets to grow to precipitable size (the "Albrecht" or second AIE) [e.g., Albrecht, 1 989]. 

In the first version of the CMIP5 ModelE2, designated as the NINT model version (for its "noninteractive" 
atmospheric composition) and identified as "pi" (physics-version = 1) in the CMIP5 archive, the mass of 
radiatively active atmospheric constituents is prescribed, based upon separate calculations of their spatial 
extent and decadal variation since the preindustrial era [ Shindell et al., 2003; Miller et al., 2006a; Koch et ai, 
201 1]. The magnitude of the aerosol indirect effect is prescribed using an empirical relation between aero- 
sol mass and low cloud cover [ Hansen et al., 2005]. The NINT version is intended for comparison to the pre- 
vious generation (CMIP3) version of ModelE described by Schmidt et al. [2006] and Hansen et al. [2007]. 

The second version of the CMIP5 ModelE2 calculates the atmospheric chemical and aerosol composition 
prognostically as described in section 3, allowing the resulting climate perturbation to feed back upon 
atmospheric composition. The aerosol indirect effect is again prescribed empirically in terms of the aerosol 
mass, as in the NINT experiment. Because of the prognostic tracers of chemistry, aerosols, and their direct 
radiative effect, this model is referred to as TCAD and identified as "p2" (physics-version=2) in the CMIP5 
archive. 

The third model version is identical to the TCAD version, except that the first AIE is parameterized according 
to a simple budget of unactivated cloud condensation nuclei, described in section 3.4.2. There is no repre- 
sentation of the second AIE. This model version is referred to as TCADI for its inclusion of the first indirect 
effect, and is identified as "p3" (physics-version = 3) in the CMIP5 archive. 

The contrast between the NINT and TCAD models shows the effect of prognostic aerosols and chemistry, 
while the TCAD and TCADI contrast shows the importance of a mechanistic versus empirical treatments of 
the aerosol indirect effect. 

The equilibrium climate sensitivity is 2.7, 2.7, and 2.9 K for the NINT, TCAD, and TCADI versions of ModelE2, 
respectively [ Schmidt et al., 2014a], These sensitivities are calculated relative to the preindustrial climate, 
after an instantaneous doubling of C0 2 , using a mixed-layer ocean with prescribed ocean heat transport. 

For comparison, the previous generation (CMIP3) version of ModelE had a sensitivity of 2.7 K [Schmidt et al., 
2006], 

2.2. Ocean Models 

For each of the three versions of the ModelE2 AGCM, we couple one of two ocean GCMs (OGCMs). 

The "Russell" OGCM, originally described by Russell et al. [1 995], and more recently by Hansen et al. [2007], is 
a mass-conserving model whose vertical coordinate is height. Conservation of mass ensures that water 
added as precipitation and runoff is conserved exactly before returning to the atmosphere through evapo- 
ration and sublimation, so that salt is conserved as well. The Russell OGCM has horizontal resolution of 
1 .25° longitude by 1 ° latitude, with four OGCM grid boxes nested within each AGCM grid box. There are 32 
vertical layers extending to a depth of 4990 m. Tracer advection is calculated using a linear upstream 
scheme that updates both the tracer value and its slope within the grid box. The additional calculation of 
the slope maintains tighter gradients against numerical diffusion. Mesoscale mixing is parameterized 
according to the Gent-McWilliams scheme, although the along-isopycnal flux was misaligned resulting in 
excessive cross-isopycnal diffusion. 

The FHybrid Coordinate Ocean Model (FIYCOM) merges layers of constant depth near the surface with iso- 
pycnal layers below [Sun and Bleck, 2006]. The isopycnal coordinate allows cross-isopycnal diffusion of trac- 
ers beneath the surface mixed layer to be kept small compared to along-isopycnal diffusion, as generally 
observed. The total number of constant depth and isopycnal layers is 26. FIYCOM has zonal resolution of 1° 
longitude with meridional resolution of 1/3° at the equator to resolve equatorial waves. The meridional grid 
spacing increases to nearly 1° outside of the equatorial waveguide before decreasing poleward as the 
cosine of latitude. A conventional spherical coordinate grid extends from the Antarctic coast to near 57°N, 
where it merges with a separate rectilinear grid that represents the Arctic Ocean without a polar singularity. 
(In the CMIP5 archive, FIYCOM ocean variables are mapped onto the 1° longitude by 1° latitude Levitus 
grid.) 
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The ModelE2 AGCM coupled to the HYCOM or Russell OGCM is designated within the CMIP5 archive as the 
GISS-E2-H or GISS-E2-R model, respectively. (For brevity, we delete "GISS" in the remainder of this article.) 
Combined with the NINT, TCAD, and TCADI physics versions of the ModelE2 AGCM, the two OGCMs com- 
prise six distinct coupled models. 

2.3. Spin-Up of PI Control Runs and Branching 

The historical experiments from 1850 to the present are initialized from a preindustrial control experiment. 
In this subsection, we describe the spin-up of the control experiment and subsequent branching of the his- 
torical experiments. 

For each of the six coupled models, the ocean is initialized from rest using the Levitus climatological 
distribution of temperature and salinity [ Levitus and Boyer, 1994; Levitus et al., 1994]. The models are 
integrated toward equilibrium with atmospheric composition and solar irradiance held constant at 1850 
values. Greenhouse gas concentrations are prescribed using ice-core measurements [cf. Schmidt et al., 
2011]. Since the initialization from the Levitus climatology, minor changes were occasionally introduced 
to the model physics. The six integrations proceeded without further change for roughly 500 years 
before the start of the preindustrial control runs. This 500 year duration is longer than the few decades 
needed to bring the upper ocean into balance, but insufficient for complete equilibration of the deep 
ocean, which requires a few thousand years. During the preindustrial control experiment, the top-of- 
atmosphere (TOA) energy imbalance in the E2-R model is small and among the top quintile of the 
CMIP5 models surveyed by Forster et al. [2013], Nonetheless, there is slow climate drift throughout this 
period, as described in section 2.4. 

Figure 1 shows that control-run variations of global surface air temperature are small compared to warming 
of approximately 1 K observed over the twentieth century [e.g., Jones et al., 201 2]. In practice, smaller drift 
can be achieved only at the considerable cost of integrating the control runs an additional few thousand 
years until the deep ocean comes into equilibrium. 

Figure 1 shows the segment of each control run that overlaps with the historical and Representative Con- 
centration Pathway (RCP) experiments, with the segment's first and last years listed in Table 1. (These years 
are measured relative to the date of the Levitus initialization, and do not indicate the degree of equilibra- 
tion, due to intervening changes in model physics as described above. We use this dating to be consistent 
with the model output archived in the piControl CMIP5 directory.) Variations in other control-run quantities 
like sea ice are similarly small (not shown), compared to recent observed trends. 

For each of the six coupled models, we carry out an ensemble of five perturbation experiments, starting 
from the segment of the preindustrial control experiment listed in Table 1, and applying various forcing 
agents as described below to simulate the changing climate of the late-nineteenth and twentieth centuries. 
(Certain ensembles in the CMIP5 historical archive also include a sixth member with diagnostics at sub- 
monthly resolution that we do not analyze here.) Individual members of each historical perturbation ensem- 
ble are branched from the control run at 20 year intervals, with 80 years separating the first and last 
members. Twenty years is an estimate of the decorrelation time for coupled experiments with slightly differ- 
ent ocean conditions, and is chosen to reduce the dependence of the ensemble mean upon the initial state 
of the preindustrial ocean, which is sparsely observed, especially below the surface. The branch points of 
the individual ensemble members are marked by blue triangles in Figure 1. In total, we have 30 historical 
experiments of the observed climate between 1 850 and the early twenty-first century, with additional 
ensembles perturbed by subsets of the forcing. 

Each of the six control-run segments listed in Table 1 has a duration exceeding 531 years. This threshold 
consists of the 81 years required to initialize the five historical ensemble members, plus 451 years for com- 
parison to the historical and RCP perturbation experiments that extend from calendar years 1850 to 2300. 

2.4. Definition of Anomalies During the Historical Period 

We define the climate response to forcing since 1 850 by subtracting from each historical experiment a 
smoothed, reference time series from the corresponding preindustrial control run whose slow variation we 
attribute to drift. A common approach is to define the slowly evolving control climate by fitting a linear 
trend [e.g., Forster et al., 201 3]. While this fit might be acceptable over the first century or so comprising the 
historical period, Figure 1 shows that a single, linear trend would fit poorly at several intervals within the 


MILLER ETAL. 


©2014. American Geophysical Union. All Rights Reserved. 


4 


®AGU Journal of Advances in Modeling Earth Systems 10.1 002/201 3MS000266 


GISS-E2-R NINT 



O 287.0 


b) 


4000 4100 4200 4300 4400 

GISS-E2-R TCAD 



c) 


GISS-E2-R TCADI 


288.0 


- 287.8 


Q) 

O 

03 

=3 

0) 


287.6 


75 287.4 


O 287.2 



d) 


4500 


f) 


GISS-E2-H NINT 



e) 


GISS-E2-H TCAD 



GISS-E2-H TCADI 


288.6 


- 288.4 


CJ 

nj 
T I 
Z3 

w 

TO 

O 


288.2 


288.0 


O 287.8 



3600 3700 3800 3900 4000 4100 


2500 2600 2700 2800 2900 3000 


Figure 1 . Global surface air temperature (black) in the preindustrial control simulations that are used to initialize the perturbation experiments. 
The five ensemble members for each perturbation experiment branch from the control experiment at the date marked by the blue triangles. 
Forced anomalies for each perturbation experiment are constructed by removing a smoothed loess fit constructed from the control (red). 


entire 531 year segment, introducing spurious variability into the perturbation time series. For example, drift 
within the E2-FH TCAD control run is rapid during the initial two centuries, but smaller thereafter. As a more 
flexible alternative to a linear fit, we define a slowly evolving control run using a loess fit [ Cleveland and Dev- 
lin, 1988], calculated using the loess function of the R statistical (or stats) library [e.g., Teetor, 201 1], We 
smooth across a span of roughly 200 years to retain slow centennial-scale drift in each control. The 
smoothed evolution of global surface air temperature in the control experiments is shown by the red curve 
in Figure 1, which shows centennial-scale variability along with temporary changes in the sign of the drift 
within certain control runs (e.g., GISS E2-FI NINT). 

Compared to linear detrending, a loess fit offers greater flexibility in defining a slowly varying reference 
time series, when centennial-scale oscillations are present (e.g., Figure 1 b). Flowever, it is not clear for how 
long these oscillations remain in unison between the control and historical experiments, and thus whether 
they should be included within the slowly evolving reference time series that is subtracted to define the 
forced response (Oscillations that are out of phase between the control and forced experiments introduce 
variability into the perturbation that result from chaotic divergence rather than the forcing). We could 
reduce oscillations in the reference time series by increasing the temporal width of the loess smoothing 
window. Flowever, this reduces the ability of the reference time series to match the changing rate of drift 
within each control. Accounting for drift depends upon finding an optimal smoothing duration that simulta- 
neously captures the initial drift while minimizing longer period fluctuations. In any case, the smoothed ref- 
erence time series in Figure 1 shows only small oscillations during the initial 1 56 years corresponding to the 


MILLER ETAL. 


©2014. American Geophysical Union. All Rights Reserved. 


5 


®AGU Journal of Advances in Modeling Earth Systems 10.1 002/201 3MS000266 


Table 1. First and Last Year of the piControl-Run Segment Used to Construct Anomalies During the Historical Period 3 


Model 

CMIP5 Archive Tag 

GISS Model Name 

First Year 

Last Year (Duration) 

E2-R NINT 

piControl_r1i1p1 

E135F40oQ32 

3981 

4530 (550) 

E2-R TCAD 

piControl_r1i1p2 

El 33TcadF40oQ32 

3590 

4120 (531) 

E2-R TCADI 

piControl_r1i1p3 

El 34TcadiF40oQ32 

3560 

4090 (531) 

E2-H NINT 

piControl_r1i1p1 

Eh135 

2410 

2949 (540) 

E2-H TCAD 

piControl_r1i1p2 

Eh137Tcad 

2490 

3020 (531) 

E2-H TCADI 

piControl_r1i1p3 

Eh134Tcadi 

2490 

3020 (531) 


a The first year corresponds to 1850 in the first member of the historical run ensembles, with subsequent ensemble members branch- 
ing every 20 years. 


historical experiments. Thereafter, the forced signal from the longer RCP projections into future centuries 
dwarfs any oscillations in the control experiments. 

Although the 1 56 year historical period originally defined by CMIP5 ends in 2005 [Taylor et al., 201 2], exten- 
sions to the end of 201 2 were performed to allow comparison to observations up until the present day. In 
response, we extended the E2-R TCADI ensemble, along with the NINT ensembles for both ocean models. 
To distinguish the effect of anthropogenic and natural forcing, we carried out additional experiments for 
the historical period with only subsets of the forcings applied. The ensembles of the historical experiments 
analyzed in this article are summarized in Table 2. 


3. Radiative Forcing During the Historical Period 

Radiative forcing is designed to allow quantitative comparison of the climate response expected from 
incommensurate forcing agents, such as changing insolation and atmospheric composition, through the 
agents' common effect upon the radiative budget. Forcing measures the energy imbalance of the planet 
before the climate adjusts to restore equilibrium. This imbalance is removed over multiple time scales as 
the climate adjusts. Different forcing definitions have been proposed that account for rapid, initial adjust- 
ment in order to better anticipate the magnitude of the final surface response. Historically, radiative forcing 
has been characterized by the initial (or "instantaneous") radiative perturbation at the tropopause [e.g., 
Ramaswamy et al., 2001; Forster et al., 2007]. The use of the tropopause (rather than the top of atmosphere 
or TOA) approximates the effect of stratospheric adjustment that occurs long before the surface tempera- 
ture reaches equilibrium [Hansen et al., 1981], a duration of a decade or so that is related to the thermal 
inertia of the upper ocean. The instantaneous tropopause forcing omits the effect of other fast responses, 
involving tropospheric clouds and water vapor, for example [e.g., Koch et al., 2009]. The surface temperature 
responds to the radiative imbalance after these fast adjustments have occurred, and recent definitions of 
"adjusted" or "effective" radiative forcing (ERF) have been proposed to include these fast changes and their 
modification of the initial radiative imbalance [e.g., Hansen et al., 2005; Myhre et al., 2014]. The ERF is typi- 
cally calculated by fixing the forcing agent, along with ocean temperature and sea ice and diagnosing the 
radiative imbalance after the atmospheric column returns to equilibrium, a calculation that must be 


Table 2. Perturbation Experiments During the Historical Period That Started From a Preindustrial Control Run in 1850 3 


Model 

CMIP5 Archive Tag 

GISS Model Name 

First Year 

Last Year 

All Forcings 
E2-R NINT 

historical_r1i1p1 

El 35f9[a-e]F40oQ32 

1850 

2012 

E2-R TCAD 

historical_r1i1p2 

E 1 37T cadf9[a-e] F40oQ32 

1850 

2005 

E2-R TCADI 

historical_r1i1p3 

El 34Tcadif9[a-e]F40oQ32 

1850 

2012 

E2-H NINT 

historical_r1i1p1 

Eh135f9[a-e] 

1850 

2012 

E2-H TCAD 

historical_r1i1p2 

Eh137Tcadf9[a-e] 

1850 

2005 

E2-H TCADI 

historical_r1i1p3 

Eh134Tcadif9[a-e] 

1850 

2005 

Anthropogenic Forcings Only 




E2-R NINT 

historicalMisc_r1 il pi 09 

El 35fa[a-e]F40oQ32 

1850 

2012 

E2-H NINT 

historicalMisc_r1 il pi 09 

Eh135fa[a-e] 

1850 

2012 

Natural Forcings Only 
E2-R NINT 

historicalNat_r1i1p1 

E135fn[a-e]F40oQ32 

1850 

2012 

E2-H NINT 

historicalNat_r1i1p1 

Eh135fn[a-e] 

1850 

2012 


a Each experiment is an ensemble of five members. 
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Figure 2. Instantaneous global radiative forcing at the tropopause (Wm 2 ) in the E2-R NINT ensemble, (a) Individual forcings and (b) Total 
forcing, along with the separate sum of natural (solar plus volcanic aerosol plus orbital) and anthropogenic forcings. 


continually repeated as the forcing agent changes with time (e.g. through varying concentration). Alterna- 
tively, the ERF is calculated by regressing temperature against the TOA radiative imbalance [Gregory et ai, 
2004], In practice, the ERF includes a spurious offset caused by the adjustment of temperature over land 
that is rapid compared to the ocean response. This offset is difficult to identify and remove on a regional 
scale [ Shindell et ai, 201 3a], Thus, the IPCC continues to use both the instantaneous tropopause value (their 
"Radiative Forcing") along with the ERF [Myhre et ai, 2014]. Because our focus is the response of climate to 
changing forcing agents since the preindustrial, we will generally characterize the forcing in terms of its 
(time-evolving) instantaneous tropopause value. Flowever, to characterize the present-day forcing, along 
with the effect of aerosols upon cloud radiances, we will make use of the ERF, as described below. 

Figure 2 shows the evolution of the global-average instantaneous forcing at the tropopause, calculated for 
the E2-R NINT ensemble. The tropopause is defined using the standard lapse-rate definition [M/. M. O., 

1 957], Inputs to the radiative calculation such as temperature, water vapor, cloud cover, and cloud optical 
thickness are saved from the E2-R NINT control experiment every two-and-a-half hours (the interval 
between successive radiative calculations in ModelE2) during the single year of 1850, before the model has 
been perturbed by any forcing agent. The forcing is calculated as the perturbation to the net radiative flux 
at the tropopause following a model change such as increased greenhouse gas concentration. The radiative 
forcing evolves in time with the evolution of the forcing agent. Calculation of forcing using the atmospheric 
state during 1850 neglects the effect of interannual variations in temperature or cloud cover, but we expect 
the resulting forcing uncertainty to be small. 

In the remainder of this section, we discuss forcing by individual agents. Data used to prescribe or calculate 
each forcing agent are summarized in Table 3. 


Table 3. Summary of Inputs Used to Prescribe (NINT) or Calculate (TCAD and TCADI) Atmospheric Composition and Forcing 

Forcing Agent NINT TCAD and TCADI 


Long-lived GHG 
C0 2 

ch 4 , n 2 o, cfc 

Ozone 


Stratospheric H 2 0 
Solar irradiance 

Land use 
Orbital 

Tropospheric aerosols 

Snow albedo 
Volcanic aerosols 


Observations [update from Hansen et al., 2007] 

Observations [Hansen et al., 2007] Observations (surface only) 

Seasonal cycle from Shindell et al. [2003] with linear decrease of SH stratospheric 1850-2000: precursor emissions from Lamarque et al. [2010], 
values from 1 979-1 997, thereafter constant to 2005 [Hansen et al., 2007]. and thereafter from RCP4.5 [Lamarque et al., 201 1], with 

volcanic aerosols from update of Sato et al. [1 993]. 

Augmented in proportion to (prescribed) surface CH 4 [Hansen et al., 2007]. Augmented using prognostic CH 4 oxidation [Shindell et al., 201 3b]. 

TSI from an updated version of Wang et al. [2005] with spectral variations from Lean [2009]. For the years 2000-2005 
along with 201 1 and 201 2, TSI is prescribed using a recent solar cycle. 

1850-1900: transition from Pongratzet al. [2008] to HYDE3.0 (used thereafter). 

None Berger [1 978] 

1 850-2000: concentration from Miller et al. [2006a] and Koch et al. [201 1 ], 1 850-2000: emission from Lamarque et al. [201 0], and thereafter 

and thereafter using year-2000 concentration. from RCP4.5 [Lamarque et al., 201 1]. 

From black carbon aerosol deposition. 

Aerosol concentration and size from update of Sato et al. [1993]. 
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3.1. Long-Lived Greenhouse Gases 

The long-lived greenhouse gases C0 2 , CH 4 , N 2 0, and chlorofluorocarbons (CFCs) have concentrations that 
are almost regionally invariant due to their long lifetime and small annual increment. Regional variations in 
monthly averaged concentration of C0 2 retrieved from satellite are 20 ppm or less compared to the present 
global value near 400 ppm [Chahine et al., 2008]. Routine in situ measurements for C0 2 are available after 
1 958 and only more recently for the other long-lived greenhouse gases. Prior to direct atmospheric sam- 
pling, past concentration is measured from ice core bubbles [e.g., Etheridge et at., 1996]. 

For the historical experiments described here, we specify the C0 2 concentration based upon measure- 
ments. [Simulations with specified emission are described by Romanou et al., 201 3.] We approximate the 
concentration of C0 2 as globally uniform in the absence of regional information prior to the most recent 
decades. Values measured in situ and from the Antarctic ice cores are converted to a global average using 
climatological spatial variations revealed by the present-day global network. Construction of the global 
average concentration is described in the supporting information for Hansen and Sato [2004], while 
updated values are tabulated at http://data.giss.nasa.gov/modelforce/ghgases/. In the NINT ensembles, con- 
centrations of the other long-lived species CFL,, N 2 0, and CFCs are prescribed to be slightly larger in the 
Northern Flemisphere (NH), based upon analysis of surface measurements by Minschwaner et al. [1998]. Pre- 
scribed concentrations of these species are vertically uniform within the troposphere, but fall off exponen- 
tially above the tropopause with a scale height of a few tens of kilometers [Hansen et al., 2005]. In contrast, 
the C0 2 concentration is assumed to be uniform throughout the entire model column. The regional and 
vertical distribution of all greenhouse gases are held fixed as their concentrations rise. 

For the TCAD and TCADI ensembles, the C0 2 concentration is prescribed as in the NINT experiments. Concentra- 
tions of other long-lived greenhouse gases (CPI4, N 2 0, and CFC) are prescribed from observations only at the sur- 
face. Their concentration above is calculated as a result of gas-phase chemistry and transport [Shindell et al., 201 3b], 

Radiative forcing by long-lived greenhouse gases was calculated with the previous (CMIP3) version of Mod- 
elE and compared to fluxes derived from line-by-line (LBL) models with substantially greater spectral resolu- 
tion [Collins et al., 2006], Different LBL models give virtually identical results, suggesting that they can serve 
as a benchmark for assessing the accuracy of GCM radiation codes. With forcing defined as the flux differ- 
ence between the present day and preindustrial, the ModelE and LBL longwave forcing differ by only 7% at 
the tropopause for the example of a midlatitude atmospheric column, while the net forcing difference is 
less than 1% due to offsetting differences in the shortwave and longwave flux. 

The CMIP5 forcing associated with long-lived greenhouse gases is larger by nearly 15% compared to the 
CMIP3 model; most of this difference is outside the Tropics (Figure 3). At present, we have not identified the 
origin of this contrast. The CMIP5 ModelE2 radiation module has been updated using the latest version of 
the FIITRAN spectroscopic database [ Rothman et al., 2009]. Flowever, forcing associated with a doubling of 
C0 2 is nearly identical between the CMIP3 and CMIP5 models [Hansen etal., 2005; Schmidt et al., 2014a], 

This suggests that the discrepancy is due to the other greenhouse gases or their spectral overlap with C0 2 , 
with a possible contribution from the different reference climates used to calculate the forcing. 

3.2. Ozone 

Tropospheric ozone has increased during the past century due to anthropogenic emission of ozone precur- 
sors like methane, nitrogen oxides, carbon monoxide, and volatile organic compounds [Seinfeld and Pandis, 
2006]. Seasonal variations of tropospheric ozone are prescribed in the NINT ensembles based upon offline 
calculations by Shindell et al. [2003]. Decadal changes result from time-dependent emissions that are 
updated every quarter-century between 1 850 and 1 950 and every decade thereafter through the 1 990s. 

The NINT distribution of tropospheric ozone is identical to that used in the CMIP3 simulations, and its tem- 
poral trends as a function of latitude, altitude, and season are shown in Figure 2 of Hansen et al. [2007]. 
Ozone loss in the polar stratospheric vortex due to ozone-depleting chemicals like CFCs is prescribed as a 
decreasing linear trend between 1979 and 1997 based upon measurements [Randel and Wu, 1999]. There- 
after, stratospheric ozone is held constant until 2005 before linear restoration to its 1979 value by 2050. The 
prescribed trend is largest in the stratosphere but extends into the troposphere at high latitudes. 

In the TCAD and TCADI ensembles, ozone is fully prognostic [Shindell et al., 2013b], Emission of ozone pre- 
cursors is prescribed using Lamarque et al. [2010] until 2000, and the RCP4.5 scenario thereafter [Lamarque 
et al., 2011], Stratospheric ozone is destroyed by the addition of chlorine radicals derived from 


MILLER ETAL. 


©2014. American Geophysical Union. All Rights Reserved. 


8 


®AGU Journal of Advances in Modeling Earth Systems 10.1 002/201 3MS000266 


a) CMIP3 GHG Forcing (2000-1880) 
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b) CMIP5 GHG Forcing (2000-1880) 
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Figure 3. Instantaneous radiative forcing at the tropopause (Wm 2 ) due to 
long-lived greenhouse gases for the period 1880-2000 in the CMIP5 (E2-R 
NINT) and CMIP3 (GISS-ER) [Hansen et ai, 2007] versions of ModelE. (a) CMIP3 
forcing (calculated using the 1880 climate as reference), (b) CMIP5 forcing 
(calculated using the 1850 climate as reference), and (c) zonal-mean forcing. 


anthropogenic emission of CFCs. This 
destruction is exacerbated after volcanic 
eruptions when heterogeneous chemical 
reactions on the surface of volcanic aerosols 
increase the concentration of chlorine radi- 
cals. During solar cycles, when UV irradi- 
ance temporarily increases, the ozone 
concentration increases in both the strato- 
sphere and upper troposphere [Shindell 
et ai, 2006b]. Both the volcanic and solar 
perturbations to the ozone concentration 
and their attendant climate response are 
present only in the TCAD and TCADI 
ensembles. 

Ozone absorbs both solar and thermal 
radiation, warming the stratosphere and 
troposphere. The global radiative forcing 
is positive during the past century (Figure 
2a) because the tropospheric increases 
overcome the effect of stratospheric 
ozone loss. Figure 4a shows the regional 
distribution of instantaneous radiative 
forcing at the tropopause for ozone in the 
E2-R NINT ensemble. (The forcing will be 
modified quantitatively due to shortwave 
absorption during stratospheric adjust- 
ment, but the regional forcing distribution 
shown in the figure will remain approxi- 
mately the same.) Ozone forcing exhibits 
a strong interhemispheric contrast with 
positive forcing associated with the large 
NFH increase of tropospheric ozone con- 
centration due to anthropogenic emission 
of precursors. This is offset by negative 
forcing from the expanding ozone hole in 
the Southern Flemisphere, where the win- 
ter stratosphere is thermally isolated and 
primed for ozone destruction [Shindell 
et ai, 2013b], 


The global radiative forcing by ozone pre- 
scribed for the E2-R NINT ensemble is 0.45 

WirT 2 in the year 2000 (Figure 4a). SFI stratospheric ozone depletion is slightly larger in the E2-R TCADI ensem- 
ble compared to the observed trend used to specify the NINT concentration, consistent with the smaller TCADI 
ozone forcing of 0.39 WrrT 2 . This contrast of the NINT and TCADI instantaneous forcings will be amplified dur- 
ing stratospheric adjustment, due to initial radiative cooling in the latter ensemble that is nearly twice as large 
(cf. Pyle et al., 2005; Box 1 .3). 


3.3. Stratospheric Water Vapor From Methane Oxidation 

Oxidation of methane contributes significantly to stratospheric water vapor [e.g., LeTexier et ai, 1 988; Tren- 
berth et ai, 2007], more than doubling its concentration in the upper stratosphere, according to calculations 
by Hansen et ai [2007]. In the TCAD and TCADI ensembles, methane is a prognostic variable and oxidation is 
calculated explicitly. In the NINT ensembles, this contribution is implemented empirically, as in the CMIP3 
model [ Hansen et ai, 2007], Stratospheric water vapor is augmented according to the surface methane con- 
centration, with a prescribed 2 year lag that accounts for transport to the stratosphere and subsequent 
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NINT Instantaneous Forcing at Tropopause (2000) 
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Figure 4. Regional distribution of instantaneous radiative forcing (Wm~ 2 ) at the tropopause for the E2-R NINT ensemble by (a) ozone, (b) aerosols on snow and ice, (c) land use, and (d) 
all forcings simultaneously present. All values are for the year 2000 compared to 1850. Forcing is calculated at 8 C latitude by 10° longitude resolution for computational efficiency so 
some regions of land-use forcing appear to extend offshore. 


photochemistry. To prescribe the magnitude along with the height and latitude dependence of this increase, 
the equilibrium response of stratospheric water vapor was calculated using a stratospheric transport model 
[Hall and Prather, 1 995], given a tropospheric methane concentration of 1 740 ppbv. Oxidation and conver- 
sion to water vapor were modeled as a linear relaxation of the methane concentration, whose rate was taken 
from a zonally averaged model with more inclusive stratospheric photochemistry [Fleming et a!., 1 999]. 

Calculation of the forcing due to oxidation of methane is complicated, because several years elapse before 
the water vapor originating as surface methane circulates throughout the stratosphere. The forcing by strat- 
ospheric water vapor is not simply related to the current tropospheric methane concentration, but is a con- 
volution of methane during the previous decade. Hansen et al. [2005] use the CMIP3 version of ModelE to 
calculate the equilibrium forcing by stratospheric water vapor following an abrupt increase in the surface 
methane concentration, estimating that the twentieth-century forcing is below 0.1 WirT 2 . We could simi- 
larly estimate this forcing in the (CMIP5) ModelE2, but because our methane concentration in the E2-R NINT 
ensemble is identical, we assume that the forcing remains small. 

Solomon et al. [2010] propose that reduced surface warming observed during the recent decade is partly a 
consequence of a declining water vapor trend in the lower stratosphere, where methane oxidation is small, 
and water vapor upwells through the tropical cold trap. This feedback shows that our methane-derived 
forcing is an incomplete measure of the total climate effect of stratospheric water vapor. 


3.4. Tropospheric Aerosols 
3.4.1. Direct Radiative Forcing 

Tropospheric aerosols in ModelE2 consist of sulfate, nitrate, organic carbon, black carbon, sea salt, and soil 
(or "mineral") dust. In the NINT ensembles, dust is prescribed using the climatological seasonal cycle calcu- 
lated by Miller et al. [2006a]. The spatial and temporal distributions of the other aerosol species are pre- 
scribed from a separate transient simulation of the late nineteenth and twentieth century with the CMIP3 


MILLER ETAL. 


©2014. American Geophysical Union. All Rights Reserved. 


10 


®AGU Journal of Advances in Modeling Earth Systems 10.1 002/201 3MS000266 


version of the GISS ModelE-R [Koch et al., 201 1]. This 
simulation included prognostic budgets of atmospheric 
chemical species and aerosols coupled by gas-phase 
chemistry and heterogeneous reactions on dust par- 
ticles between sulfate and nitrate precursors and ozone 
[Bell et al., 2005; Bauer and Koch, 2005; Shindell et al., 
2006a; Koch et al., 2006; Koch et al., 2007; Bauer et al., 
2007]. Emission was prescribed and updated once each 
decade starting in the 1890s using community data- 
bases [e.g., van Aardenne et al., 2001], so that direct and 
indirect forcing by anthropogenic species are zero prior 
to this date. The CMIP3 model has 4° latitude by 5° lon- 
gitude horizontal resolution and 20 vertical layers. The 
profile in each CMIP3 column was used directly without 
horizontal interpolation for each of the four CMIP5 
model columns nested within. Remapping to the higher 
CMIP5 vertical resolution was conservative, assuming 
that the concentration is piecewise constant within 

each CMIP3 layer. 

In the TCAD and TCADI ensembles, concentrations of aerosols and atmospheric chemical species are calcu- 
lated prognostically and differ from the prescribed NINT values for several reasons. First, an updated emis- 
sion inventory is used for anthropogenic aerosols and biomass burning [Lamarque et al., 2010]. Second, 
aerosol physics is more extensive in the current version of ModelE2, with the introduction of secondary 
organic aerosols, for example. Third, the climate model itself has changed, most notably with doubled hori- 
zontal and vertical resolution since the calculations used to prescribe the NINT concentrations. In the TCAD 
and TCADI models, nitrate mass is multiplied by 0.2 in the radiative calculation to account for concentra- 
tions that are believed to be excessive. Black carbon mass is multiplied by 1 .5 to account for coatings of sul- 
fate and nitrate that increase shortwave absorption through internal reflection [Bauer et al., 2007], 

In the E2-R NINT ensemble, direct radiative forcing by aerosols in the year 2000 is —0.19 and 0.00 WirT 2 at the 
tropopause and TOA, respectively (Table 4). (Our definition of aerosol forcing includes only anthropogenic spe- 
cies, and excludes dust and sea salt.) Direct forcing at TOA in the E2-R TCAD and TCADI ensembles is slightly 
more negative at -0.06 and -0.05 WirT 2 , respectively. Aerosol forcing is sensitive to the reference climate 
used to calculate the radiative fluxes. For example, Shindell et al. [201 3a] calculated the shortwave component 
of aerosol direct forcing at TOA in the E2-R TCADI ensemble as -0.49 WirT 2 . Their forcing is calculated prog- 
nostically using the "double call" method, where the flux difference including and omitting aerosols is calcu- 
lated using the instantaneous climate. That is, aerosol forcing during the year 2000 is calculated relative to the 
climate at that same point in time. In contrast, we calculate our forcing relative to the start of the historical 
period in 1850 (while including both shortwave and longwave contributions). Table 5 shows the result of addi- 
tional calculations that were carried out to reconcile the 
difference in TCADI aerosol direct forcing between our 
method and that of Shindell et al. [2013a]. Recalculating 
our net flux difference using the year-2000 climate 
reduced the TOA forcing to -0.41 WirT 2 , while the 
shortwave contribution alone was -0.48 WrrT 2 , nearly 
identical to the value calculated by Shindell et al. [2013a], 
Koch et al. [2011] similarly found that NINT aerosols in the 
year 2000 result in TOA direct forcing of -0.40 WirT 2 
when using the double-call method (compared to our 
value of 0.00 WirT 2 based upon the 1850 climate). Table 
5 shows that aerosol direct forcing is highly sensitive to 
the climate used to calculate the radiative fluxes. The 
same aerosol distribution that would barely perturb the 
TOA flux in 1 850 results in a much larger forcing magni- 
tude by the end of the historical period. The aspect of 


Table 5. TCADI Global-Mean Direct Aerosol Forcing at 
TOA and Its Dependence Upon the Climate Used to 

Calculate the Radiative Fluxes 3 

Forcing 

Experiment 

(WrrT 2 ) 

Net relative to 1850 climate (Figure 5b) 

-0.05 

Net relative to 2000 climate 

-0.41 

Shortwave only, relative to 2000 climate 

-0.48 

Shindell et al. [201 3a] shortwave 

-0.49 


Additional differences in the global-mean forcing 


between Figure 5b and Shindell et al. [2013a] result from 
differences in temporal or ensemble averaging. Figure 5b 
is calculated using aerosol concentration from a five- 
member ensemble mean, while Shindell et al. [201 3a] 
uses a single ensemble member averaged over an 1 1 
year period centered about the year 2000. 


Table 4. Comparison of Forcing for the Year 2000 Within 
the E2-R NINT, TCAD and TCADI Ensembles Calculated 


Relative to 1850 3 

NINT 

TCAD 

TCADI 

Ozone* 


0.45 

- 

0.39 

Tropospheric Aerosol 
Direct 

TOA 

0.00 

-0.06 

-0.05 

Tropopause 

-0.19 

-0.19 

-0.19 

Surface 

-0.77 

-0.65 

-0.69 

Indirect 

TOA 

-0.66 

-0.64 

-1.05 

Tropopause 

-0.67 

-0.66 

- 

Surface 

-0.42 

-0.40 

- 

Effective RF 

TOA 

- 

- 

-1.10 


a The TCADI ERF is provided by Shindell et al. [201 3a]. 
*Ozone forcing represents the instantaneous value at 
the tropopause. 
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Figure 5. Regional comparison of aerosol radiative forcing (WrrT 2 ) at TOA between the GISS E2-R NINT and TCADI ensembles, (a and b) direct radiative forcing, (c and d) indirect effect, 
(e and f) total aerosol forcing. For the TCADI ensemble, the total aerosol forcing is taken from the effective radiative forcing (ERF) [Shindell et at, 201 3a], and the indirect effect is diag- 
nosed by subtracting the direct radiative forcing from the ERF. All values are for the year 2000 compared to 1850. Forcing is calculated at 4° latitude by 5° longitude resolution, while the 
TCADI ERF is calculated at twice this resolution. 


climate change that causes this increase in forcing magnitude deserves further investigation. 

Accounting for differences in the method of calculation (based upon Table 5), our aerosol direct forcing for 
all E2-R ensembles is within the range of —0.58 and -0.02 WrrT 2 calculated for Phase II of the Aerosol 
Comparisons between Observations and Models ("AeroCom") Project [ Myhre et at., 201 3], Direct forcing in 
this project includes only the shortwave component and was calculated using meteorological variables 
from 2006. Our forcing is also consistent with the Atmospheric Chemistry and Climate Model Intercompari- 
son Project (ACCMIP) best estimate of -0.33 to -0.50 Wrrfo 2 [Shindell et al., 2013a], 

The regional contrast between the E2-R NINT and TCADI direct forcing is shown in Figures 5a and 5b. 
Because of large spatial contrasts in aerosol concentration, we calculate the forcing at higher resolution (4° 
latitude by 5° longitude) compared to the other forcing agents. Although the global-mean forcing is similar 
in the two models, there are large regional differences. In the NINT ensemble, large negative forcing occurs 
in populous and industrialized regions such as Europe along with eastern North America and Asia. In con- 
trast, the TCADI ensemble has positive forcing in these latter two regions. The TCADI model shows large 
negative forcing near Indonesia that has no counterpart in the NINT ensemble. Aerosol forcing by the TCADI 
ensemble along with the other ACCMIP models was compared to an extensive set of observations by 
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Shindell et at. [201 3a]. These models generally reproduce the observed total aerosol optical thickness, 
despite its systematic underestimation over Asia. 

The deposition of black carbon aerosols upon snow decreases the surface albedo and increases solar 
absorption. Its global effect (referred to as "Snow Albedo" in Figure 2) is small in both the E2-R NINT and 
TCADI ensembles, but can dominate the total forcing in specific regions like the Himalayas and Tibetan Pla- 
teau (Figures 4b and 4d), where there is a bright, snowy surface, high solar zenith angle, and a significant 
nearby source of anthropogenic aerosols. Bauer et at. [201 3] provided additional description of black carbon 
deposition upon snow in the TCADI ensembles and comparison to measurements. 

None of the ensembles allow temporal variations in dust sources. A recent attribution based upon satellite 
retrievals suggest that one-guarter of the present-day dust load originates from anthropogenic sources cre- 
ated by cultivation and grazing [Ginoux et at., 2012]. This suggests that dust source extent has varied mark- 
edly since the preindustrial era. Mahowald et at. [2010] used ice cores and other deposition records to 
estimate that the dust load may have doubled during the twentieth century. For comparison, the global 
dust load in the TCAD and TCADI ensembles shows significantly smaller variations (on the order of 1 0%) 
resulting from climate variations that influence dust emission, transport and removal. 

The emission inventories used to prescribe or calculate aerosol concentration in the historical simulations 
extend only to the end of the twentieth century [van Aardenne et at., 2001 ; Lamarque et at., 2010]. Between 
the years 2000 and 2005, the emission inventories in the TCAD and TCADI experiments are taken from the 
RCP4.5 scenario [Lamarque et at., 201 1]. For the NINT experiments, aerosol concentrations are held constant 
after the year 2000. 

3.4.2. Aerosol Indirect Effect 

In the NINT and TCAD ensembles, the magnitude of the AIE is tuned using an empirical relation between 
low cloud cover and the logarithm of aerosol number concentration [Hansen et at., 2005, 2007]. The loga- 
rithmic dependence results in a stronger indirect effect in regions of pristine air. Aerosol number concentra- 
tion is derived from aerosol mass using prescribed particle size and solubility. Dust and salt aerosols do not 
contribute to the AIE due to their presumed small anthropogenic component. Aerosols modify only low 
cloud cover, and only where these clouds would have formed in the absence of aerosols. (This prevents 
aerosols from creating clouds where humidity is low.) Because the NINT ensembles were designed to be 
compared with the corresponding CMIP3 experiments, we intended the AIE at the tropopause to be near 
the value of —1 WirT 2 used by Hansen et at. [2007]. However, the precise magnitude of this forcing cannot 
be specified a priori because it depends upon the simulated climate, especially the cloud distribution. For 
the E2-R NINT ensemble in the year 2000, the actual instantaneous AIE at the tropopause is -0.67 WrrT 2 
(Table 4), with its regional dependence shown in Figure 5c. For the E2-R TCAD ensemble, this forcing is 
slightly smaller at -0.66 WirT 2 , with a nearly identical regional distribution (not shown). Note that while 
the TCAD direct forcing is calculated using prognostic aerosol concentration, the TCAD AIE is calculated 
using the same offline aerosol concentration prescribed in the NINT ensembles. 

In the TCADI ensembles, the first AIE is represented mechanistically using a prognostic budget for the num- 
ber of unactivated cloud condensation nuclei. This number is a balance between aerosol supply and loss by 
wet deposition, among other processes [Morrison and Gettelman, 2008; Menon et at., 2010], Only the first 
indirect effect is represented; the autoconversion rate depends upon only the cloud water content, and is 
not reduced with decreasing droplet size. At present, larger aerosols like dust do not nucleate ice crystals. 

For the E2-R TCADI ensemble, an ERF of - 1 .1 0 WrrT 2 is calculated by fixing ocean temperature (at its 1 850 
value) and allowing the model to come into equilibrium with the emission of anthropogenic aerosols pre- 
scribed for the year 2000 [Shindell et at., 201 3a]. The ERF also includes the semidirect effect [Hansen et at., 1 997; 
Perlwitz and Miller, 2010], along with changes to other variables such as water vapor and the lapse-rate in 
response to the introduction of aerosols. We define the TCADI AIE as the difference between the ERF and the 
direct radiative forcing by the aerosols. This contrast is somewhat inconsistent because the aerosol concentra- 
tion used to calculate the ERF differs slightly from that used for the direct forcing, since the former is based 
upon prescribed year-2000 emission but a climate consistent with 1850 ocean temperatures. (The concentra- 
tion used to calculate direct forcing results from both emission and climate taken from 2000.) In any case, the 
resulting AIE at TOA is - 1 .05 Wm ~ 2 , whose magnitude is almost twice the E2-R NINT value (Table 4). 
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Because of its inclusion of other feedbacks, the ERF of the TCADI model is only approximately comparable 
to the total forcing of the NINT and TCAD ensembles that is the sum of the direct forcing and the AIE. With 
these caveats, the aerosol ERF and total forcing are most negative in the TCADI ensemble, but all are within 
the large range of -0.9 ± 1.0 WrrT 2 compiled from observational estimates and other CMIP5 models 
[Myhre et al., 2014]. 

In the E2-R NINT and TCAD ensembles, the magnitude of AIE is largest over eastern Asia, collocated with 
large direct radiative forcing (Figure 5c). (Because of the similarity of the regional distributions of NINT and 
TCAD forcing, only the former are shown in the figure.) The AIE of the E2-R TCADI ensemble, calculated 
from the difference of the ERF and direct forcing, is largest over Indonesia (Figure 5d). This corresponds to a 
region where the magnitude of direct forcing is large, but the AIE is also large within the North Pacific and 
North Atlantic storm tracks that are downwind of large direct forcing. Similarly, the TCADI AIE is large just 
offshore of the region of biomass burning over West Africa. The TCADI experiments show reduced errors in 
cloud fraction in these regions [Schmidt et at., 2014a], although confident attribution of cloud anomalies to 
the presence of aerosols requires a more careful analysis. 

Because it is the ERF rather than the AIE that is directly calculated for the TCADI model, we show the former 
along with the total forcing for the NINT model in Figures 5e and 5f. Because of the small magnitude of the 
direct effect in both models (at least when calculated relative to the 1850 climate), the ERF and total forcing 
are strongly influenced by the interaction of aerosols with clouds. 

3.5. Volcanic Aerosols 

Forcing by volcanic aerosols in the stratosphere depends upon the optical thickness and effective radius of 
the sulfate droplets created by oxidation of sulfur dioxide injected above the tropopause. For all ensembles, 
we specify volcanic aerosol properties in the stratosphere following major eruptions, along with a time- 
invariant background state that represents the effect of ubiquitous smaller eruptions that maintain a mini- 
mum aerosol concentration between major events. For both components, the stratospheric aerosol optical 
thickness and effective radius as functions of latitude and height are prescribed using a compilation 
updated from Sato et al. [1 993], available at http://data.giss.nasa.gov/modelforce/strataer/. The aerosol 
effective radius is typically 0.2 /rm but can grow as large as 0.6 gm in the Tropics in the first year following a 
major tropical eruption before relaxing to smaller values. Aerosol properties for eruptions within the satellite 
era are constrained by measurement, but values from prior eruptions are characterized by extrapolating 
from measurements following El Chichon and Pinatubo. Hansen et al. [2007] estimates the forcing uncer- 
tainty as 20% for Pinatubo and as large as 50% for Krakatau. At the time we carried out the historical inte- 
grations, we had not updated the volcanic aerosol compilation after the year 2000. For subsequent years 
through 2005, we held aerosol properties constant at their year-2000 value. 

3.6. Solar Irradiance 

Total solar irradiance (TSI) varies with an approximate 1 1 year cycle, but longer term variations in the cycle 
amplitude and minimum value (or "background") are known less precisely and still debated [ Schmidt et al., 
2011, 2012]. Variations in irradiance over the cycle are nearly in phase at all wavelengths, with the largest 
variations at shorter wavelengths near the ultraviolet [Lean, 2009]. 

While TSI has been measured from space since 1978, radiance measurements were less reliable between 
mid-1 989 and the fall of 1 991 , and there are multiple reconstructions of a continuous multidecadal satellite 
record of TSI. Prior to the satellite measurements, TSI and its spectral dependence must be inferred from 
proxies like visible images and counts of sunspots. 

For the historical period starting in 1850, we use the TSI reconstruction of Wang et al. [2005] as recom- 
mended for the CMIP5 experiments (http://sparcsolaris.gfz-potsdam.de/cmip5.php). This postulates smaller 
changes in the background over the historical period, and thus smaller solar forcing, compared to Lean 
[2000] that was used for the GISS CMIP3 simulations [Hansen et al., 2007]. Between 2000 and 2005, TSI is 
prescribed by repeating the previous 1 1 year cycle instead of using measurements. Spectral variations are 
taken from Lean [2009]. Recent observations of out-of-phase behavior [Harder et al., 2009] remain to be con- 
firmed [Lean and Deland, 2012], and were not used in this study. 

During the last decade of the twentieth century, global forcing by TSI relative to 1850 ranges between zero 
and 0.14 WrrT 2 (Figure 2a). The forcing trend during the entire historical period is slightly positive, but 
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Table 6. Instantaneous Radiative Forcing at the Tropopause (WrrT 2 ) in 2003 
Relative to 1880 for the CMIP5 E2-R NINT Ensemble Compared to Forcing in the 
Previous Generation (CMIP3) GISS ModelE 3 


ModelE-R ModelE2-R NINT 

Forcing (CMIP3) (CMIP5) 


Well-mixed greenhouse gases 

2.62 

2.99 

Ozone 

0.44 

0.45 

Solar irradiance 

0.23 

0.11 

Land use 

(-0.09) 

-0.14 

Snow albedo 

0.05 

0.06 

Orbital 

- 

<0.003 

Tropospheric aerosols (direct) 

-0.41 

-0.19 

Tropospheric aerosols (indirect) 

-1.00 

-0.67 

Volcanic aerosols 

0.00 

-0.14 

Sum 

1.84 

2.47 


a The period of 2003 to 1880 is chosen to allow comparison to the CMIP3 values 
in Table 1 of Hansen et al. [2007]. Due to the omission of the CMIP3 instantaneous 
forcing for land use, we use the ERF value (denoted as F s in that study). For strato- 
spheric water vapor from methane oxidation, we assume that the CMIP5 forcing 
is comparable to the value of 0.1 Wm~ 2 calculated for the CMIP3 model. 


remains small compared to that of 
long-lived greenhouse gases, a com- 
parison that remains robust when 
forcing is calculated from other TSI 
reconstructions. 

3.7. Orbital Variations 

Orbital parameters including eccen- 
tricity, obliquity, and the date of peri- 
helion vary over the course of the 
TCAD and TCADI historical ensem- 
bles according to Berger [1978]. (We 
inadvertently omitted orbital forcing 
from the NINT ensembles, so the 
value in Table 6 represents the 
importance of this neglected forc- 
ing.) The associated forcing since 


1850 is small (with a global value of 
less than 0.003 Wm~ 2 , and seasonal 

differences are less than 1 WirT 2 in magnitude at any latitude. Nonetheless, we include orbital forcing in 


the historical runs to allow a coherent comparison to CMIP5 runs for the last millennium where regional var- 
iations of insolation compared to the present day are as large as 5-1 0 WirT 2 in magnitude. 


3.8. Land Use 

After 1900, we specify the location of cropland and pasture using the HYDE3.0 land-use and land-cover 
data base [Klein Goldewijk and van Drecht, 2006]. This data base was not available in time for the CMIP5 
ModelE2 simulation of the last millennium, which instead uses Pongratz et al. [2008]. Although HYDE3.0 is 
available back to the beginning of the historical period, we blend these two data sets for the period 
between 1850 and 1900 using linear weighting with respect to time for continuity of forcing between the 
last millennium and historical experiments. 

Land-use forcing due to changes in surface albedo is as large in some cultivated regions as that due to 
increasing greenhouse gas concentrations (Figure 4c). The forcing is negative, because fields cleared for cul- 
tivation and grazing are typically brighter than the natural vegetation they replace. Figure 4c shows clearing 
of the rain forests in Indonesia and Central America, along with contraction of the boreal forests in the west- 
ern plains of Canada. Conversely, there is positive forcing in the northeastern United States where farms 
abandoned during the last century are being reclaimed by forest succession. Global average land-use forc- 
ing in the E2-R NINT ensemble is -0.19 WrrT 2 at the tropopause by the start of the twenty-first century, 
comparable to the direct forcing by tropospheric aerosols. 

The historical experiments do not include prescribed changes in irrigation that have been subsequently 
introduced into ModelE2, and that reduce surface warming by increasing evaporation over agricultural 
regions (B. Cook et al., Irrigation as an historical climate forcing, submitted in 2013 to Climate Dynamics). 


3.9. Extension of Historical Runs From 2005 to Present Day (2012) 

We extended both of the NINT ensembles and the E2-R TCADI ensemble to allow their comparison to obser- 
vations up until the present day. When the ensemble extensions were carried out, forcing observations 
were available only through 2010. For the years 201 1 and 2012, we used RCP4.5 to prescribe concentrations 
of well-mixed greenhouse gases. For these years, solar irradiance was taken from 1 998 and 1 999: analogous 
years in a recent solar cycle. (The effect of taking values from previous cycles in lieu of observations resulted 
in a slight overestimate of solar forcing since 2000.) We used an updated atlas of volcanic aerosol properties 
between 2006 and 2010 [cf. Sato et al., 1993]. No volcanic eruptions were prescribed for 201 1 and 2012, 
when volcanic aerosol concentration was limited to a small background value. 

In the NINT extensions, tropospheric aerosols subsequent to 2005 were fixed at year-2000 concentrations, 
the last year in the Koch et al. [2011] simulations. Stratospheric ozone, held constant since 1997, began a 
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slow recovery after 2005 [Hansen et al., 2007]. For the TCADI extension, tropospheric aerosols and ozone 
were calculated using emission inventories from RCP4.5 [Lamarque et al., 201 1], 

Because observed solar irradiance and volcanic aerosol concentrations were not used between 2000 and 
2005, there are small discontinuities in the forcing at the beginning of the extension runs in 2006. 

3.10. Total Forcing and Comparison Between CMIP3 and CMIP5 

The total instantaneous tropopause forcing for the E2-R NINT ensemble (Figure 2b) is dominated by the 
effect of increasing greenhouse gas concentrations, offset by aerosol forcing with intermittent contributions 
by volcanic aerosols following eruptions. Figure 4d shows that the effect of the combined forcing agents is 
nearly everywhere positive. There is negative forcing only in a few limited regions, like eastern Asia as a 
result of tropospheric aerosols, and Indonesia due to land use. 

Table 6 compares the E2-R NINT forcing to that calculated with the previous (CMIP3) version of ModelE 
[. Hansen et al., 2007, Table 1], The comparison is based upon the difference in the instantaneous tropopause 
values between 2003 and 1 880. [While the forcing is computed as the difference over the same period for 
both models, the year 1850 represents the unperturbed climate for the calculation of the E2-R NINT forcing, 
compared to the year 1880 used by Hansen et al., 2007.] Greenhouse forcing is 15% larger in the E2-R NINT 
ensemble, as described previously (Figure 3). The negative offset by the aerosol forcing in the E2-R NINT 
ensemble is smaller in magnitude compared to the CMIP3 value, due to both direct and indirect effects. 
Solar forcing is smaller in the E2-R NINT ensemble. The sum of the forcings for the E2-R NINT ensemble is 
larger than the CMIP3 ModelE-R sum by about 0.6 WrrT 2 . 

Note that the forcing in Table 6 is calculated relative to its value in 1880, by which time there was already 
forcing due to rising greenhouse gas concentrations (Figure 2a). The present-day NINT forcing relative to 
the (1850) preindustrial climate is larger; 2.88 and 3.32 WirT 2 by 2000 and 2012, respectively (Figures 2b 
and 4d). 

Figure 2b shows the separate contributions to the sum of the forcings by natural and anthropogenic effects. 
The former combines forcing by solar and orbital variations, along with volcanic aerosols, while the latter is 
the sum over the remainder of the individual forcings. Natural forcing is dominated by episodic increases in 
the volcanic aerosol concentration following major tropical eruptions, with a residual trend from solar irradi- 
ance that is positive but small over the historical period. In contrast, the pronounced upward trend of the total 
forcing is almost perfectly matched by the slow but persistent increase of the anthropogenic component. 

Computation of the total instantaneous forcing for each ensemble is expensive. As an alternative, the ERF 
calculated according to Gregory et al. [2004] is shown in Figure 6. The relation between surface air tempera- 
ture and the TOA radiative imbalance is calculated using regression within the first 150 years of an experi- 
ment after the concentration of C0 2 is abruptly quadrupled. This regression coefficient is then multiplied by 
temperature variations over the historical period and combined with the radiative imbalance to derive the 
ERF [ Forster and Taylor 2006], (We do not attempt any correction resulting from rapid adjustment of the 
land temperature.) For comparison, the instantaneous tropopause forcing (denoted by "iRF") from the E2-R 
NINT ensemble is shown. The E2-R NINT ERF exceeds the iRF early in the historical period, but is up to 0.3 
WirT 2 smaller by the most recent decade. This may be the result of rapid adjustments to the forcing, or 
nonlinearities in the relation between temperature and the TOA imbalance that we do not account for. 
Among the historical experiments, the two TCAD ensembles have the lowest ERF. The deficit increases with 
time and is as large as 0.5 WirT 2 by the present day. 

Forster et al. [201 3] derived the ERF for several CMIP5 models during the historical period. The GISS models had 
the largest forcing when perturbed only by greenhouse gases, and were in the top guartile of total forcing. 

4. Results 

4.1. Surface Air Temperature 

Globally averaged surface air temperature during the historical period is shown in Figure 7 for all forcings 
along with anomalies from separate simulations perturbed by either anthropogenic or natural forcings. 
(Table 2 presents a complete list of the historical experiments.) Each model is represented by its ensemble 
mean. For comparison, the GISTEMP analysis of global surface air temperature is shown [ Hansen et al., 
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Figure 6. (a) Effective radiative forcing at TOA (Wm 2 ) relative to 1 850, and calculated according to Forster and Taylor [2006]. The black 
line shows instantaneous radiative forcing at the tropopause ("iRF") for the E2-R NINT ensemble, (b) excerpts the ERF since 1970. 


2006]. Other observational analyses document similar warming [e.g., Jones et at., 201 2; Rohde et at., 201 3]. 
The uncertainty of the GISTEMP global temperature anomaly (corresponding to twice the standard error) is 
±0.05 K in recent decades and twice that in the earliest part of the record [Hansen et at., 2006]. This uncer- 
tainty includes the effect of measurement error and incomplete global coverage by the observational net- 
work. Morice et at. [2012] estimate roughly twice this uncertainty for the HadCRUT4 temperature analyses, 
but in either case the errors are small compared to the observed warming of roughly 1 K over the course of 
the twentieth century. 

The model anomalies are perturbations from the unforced preindustrial climate, and indicate the effect of 
forcing. The forced response in the GISTEMP analysis is harder to isolate, because the global record starts in 
the late-nineteenth century when the response to increases in greenhouse gas concentrations may have 
already begun. In Figure 7, we define the model and observational anomalies to have zero average between 
1880 and 1900, the earliest part of the observational record. 

The models warm during the twentieth century at a slightly greater rate than observed, with the exception 
of the TCAD simulations. The range of values spanned by individual members within the E2-R NINT ensem- 
ble is indicated by gray shading that corresponds to two standard deviations of the intraensemble anoma- 
lies. (The standard deviation is similar for other ensembles.) The E2-R ensembles and the GISTEMP analysis 
agree within two standard deviations until the last decade, when the NINT and TCADI simulations are 
roughly 0.1 K too warm compared to observations by 201 2 (Figure 7b). Warming is larger in the E2-FH simu- 
lations, which is associated with relatively small heat uptake by the deep ocean compared to the E2-R 
ensembles (section 4.4). 

The NINT and TCADI ensembles warm at similar rates during the twentieth century, even though the latter 
has a slightly larger climate sensitivity. This sensitivity contrast is offset by the greater magnitude of TCADI 
aerosol forcing, mainly due to aerosol effects upon clouds (compare Figures 5c and 5d). The forcing contrast 
is augmented by the smaller TCADI ozone forcing. The TCAD ensembles also have prognostic ozone, which 
contributes to their slower rate of warming compared to the NINT ensembles with which they share a low 
climate sensitivity relative to the TCADI model. Prognostic TCAD ozone would contribute to these ensem- 
bles' lower effective forcing in Figure 6. We do not have an explanation for the similarity of the NINT and 
TCADI forcings in this figure. Calculation of the ERF assumes that climate sensitivity, which for the figure 
was calculated with respect to quadrupling C0 2 , is the same for all forcing agents. This may not be valid 
[Hansen et at., 2005], and would cause inaccuracies in the derived ERF, where the forcing agents are varying 
in time and relative importance. 

Each ensemble warming trend is consistent with the observed warming of the surface between 1979 and 
2005 (Figure 8), as are the trends of many of the individual ensemble members. We chose the beginning of 
this period to coincide with the starting date typically used to calculate trends in the Microwave Sounding 
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Figure 7. Global surface air temperature anomalies (K) for the historical experiments relative to a smoothed version of the control 
run. Gray shading shows plus or minus two standard deviations for the five members of the E2-R NINT ensemble. For compari- 
son, the observed GISTEMP anomaly is plotted. All time series are plotted relative to their 1880-1900 average. The historical 
experiments are perturbed by (a and b) all, (c) only anthropogenic, and (d) only natural forcings. Figure 7b excerpts the response 
to all forcings since 1970. 

Unit temperature retrievals described in section 4.2. The end of the period marks the conclusion of the his- 
torical experiments, after which not all ensembles are available. Table 7 gives numerical values for the trend 
of the observations along with each ensemble mean over this period, showing agreement within the uncer- 
tainty due to interannual variations. 

In the last decade, the observed rate of warming has slowed, but the historical ensembles extended past 
2005 have generally maintained their upward trend of temperature (Figure 7b). Table 7 compares the 
observed and modeled trends over the period 1 979-2012 using only the extended ensembles. Only the E2- 
R NINT trend shows any overlap with the observed warming; the other two ensembles show excessive 
warming. This overestimate could result from excessive climate sensitivity, insufficient heat uptake by the 
deep ocean, or an overestimate of the total climate forcing. We have noted that the effective forcing by the 
NINT ensembles is among the larger values associated with the CMIP5 models [Forster et a!., 201 3]. More- 
over, recent observations suggest that the CMIP5 forcing protocols (defined prior to the conclusion of the 
extended simulations) are a slight overestimate [Schmidt et a!., 201 4b]. Part of the warming discrepancy 
may also be the temporary result of fluctuations in deep ocean heat uptake that are uncorrelated between 
the models and observations [Meehl et at., 2011], Alternatively, the model climate sensitivity may be too 
large due to unrealistic features that we have yet to identify. 

Experiments including anthropogenic forcing can reproduce the warming observed during the twentieth 
century (Figure 7c), consistent with previous generations of coupled models surveyed by the IPCC [e.g., 
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Hegerl et al., 2007], In contrast, models 
with only natural forcings like solar irra- 
diance and volcanic aerosols show no 
sustained warming trend (Figure 7d). 
The most discernible response to natu- 
ral forcing is the cooling that follows 
volcanic eruptions. This temporary 
cooling is apparent in the GISTEMP 
analysis, except following the Krakatau 
eruption in 1 883, when the absence of 
cooling may be a consequence of lim- 
ited ocean measurements. (Following 
Krakatau, land stations cool at a magni- 
tude consistent with our ensembles) 
[Hansen et at. [2007, Figure SI]. 

The latitudinal dependence of surface 
warming is shown in Figure 9. Anoma- 
lies are defined relative to their time 
average between 1961 and 1990, a 
period chosen because it is the earliest that the GISTEMP analyses are available at all latitudes, including 
those containing Antarctica. Because the average temperature of each time series is constrained to be iden- 
tical late in the record, contrasting rates of warming between the ensembles are most apparent in the earlier 
decades. Volcanic eruptions are marked in the figure by green dashed lines, and are generally followed by 
cooling as seen in the global average temperature of Figure 7. (Table 8 identifies the eruptions marked in 
this and subsequent figures.) For this and subsequent plots of the temporal evolution of zonal means, dots 
mark locations where the anomaly (or difference) is outside the 95% confidence interval that results from 
intraensemble variability. For comparison between the model and observations, this interval is proportional 
to the intraensemble standard deviation; dots indicate where the observations are significantly outside the 
range of ensemble members and thus inconsistent with the model. For comparison between model ensem- 
ble means, this interval is proportional to the intraensemble standard error, and dots indicate where the 
ensembles are statistically distinct. 

Warming within the E2-R NINT ensemble is largest over the Arctic (Figure 9a). This "Arctic amplification" is 
robust among models [Holland and Bitz, 2003], and partly the result of the surface-albedo feedback, 

although other feedbacks involving 
clouds along with atmosphere and 
ocean heat transport contribute [Kay 
et al., 201 2]. Warming is only slightly 
enhanced at high SH latitudes, where 
the Antarctica surface albedo is buf- 
fered by a deep layer of snow and ice, 
and there is negative forcing by strato- 
spheric ozone loss (Figure 4a) along 
with substantial export of heat to the 
deep ocean. 

Comparison of model warming to the 
observed trend is obscured by interde- 
cadal variability. The largest discrep- 
ancy occurs prior to the midtwentieth 
century when temporary warming 
observed at NFH high latitudes is not 
simulated by the E2-R NINT ensemble 
(Figures 9b and 9c). Individual mem- 
bers of the ensemble show warming 


Table 7. Modeled and Observed Trends in Surface, Middle Troposphere, and 
Lower Stratospheric Global Average Temperature (K per decade) for all Histori- 
cal Ensembles (1979-2005) and Only the Extended Ensembles (1 979-201 2) a 



Surface 

Middle Troposphere 

Lower Stratosphere 



1979-2005 



Obs 

0.1 92 ±0.050 

0.106 ±0.078 

-0.367 

±0.159 

E2-R NINT 

0.192 ±0.057 

0.1 53 ±0.060 

-0.354 

±0.198 

E2-R TCAD 

0.188 ±0.065 

0.148 ±0.077 

-0.368 

±0.211 

E2-R TCADI 

0.224 ± 0.072 

0.1 78 ±0.091 

-0.371 

± 0.209 

E2-H NINT 

0.215 ±0.057 

0.1 55 ±0.058 

-0.362 

± 0.201 

E2-H TCAD 

0.186 ±0.061 

0.128 ±0.073 

-0.381 

± 0.207 

E2-H TCADI 

0.227 ± 0.066 

0.1 70 ±0.079 

-0.359 

±0.217 



1979-2012 



Obs 

0.168 ±0.033 

0.080 ± 0.053 

-0.309 

±0.101 

E2-R NINT 

0.191 ±0.036 

0.168 ±0.038 

-0.303 

±0.123 

E2-R TCADI 

0.234 ± 0.045 

0.21 7 ±0.058 

-0.300 

±0.133 

E2-H NINT 

0.232 ± 0.036 

0.1 90 ±0.039 

-0.307 

±0.125 


a Model trends represent the ensemble mean. Observed trends are calcu- 
lated from GISTEMP (surface) and MSU channels TMT (middle troposphere) 
and TLS (lower stratosphere). Also shown is the uncertainty represented by 
two standard deviations of the trend estimate (related to interannual variabili- 
ty). Different methods of MSU TMT retrieval contribute additional trend uncer- 
tainty near 0.05 K per decade [cf. Thome et al., 2011, Figure 10]. 
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Figure 8. Change in modeled and observed global average temperature between 
1979 and 2005 at the surface, middle troposphere, and lower stratosphere. The 
observed values are from GISTEMP (surface), and MSU channels TMT (middle tro- 
posphere) and TLS (lower stratosphere). The observed value is in purple with its 
uncertainty in black (corresponding to plus or minus two standard deviations of 
interannual variations). Ensemble average change is shown as triangles (NINT), 
diamonds (TCAD), and circles (TCADI), with change for individual ensemble mem- 
bers plotted as small triangles. Change is expressed per decade. 
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Figure 9. Zonal average of surface air temperature anomalies (K) for the model ensemble means and the GISTEMP analyses. The average anomaly at each latitude is defined to be zero 
between 1961 and 1 990. Dots mark locations where the anomaly (or difference) is outside the 95% confidence interval that results from intraensemble variability. The dates of major vol- 
canic eruptions (listed in Table 8) are marked by green dashed lines. 


closer to the observed value, but the intraensemble standard deviation is around 0.5 K, less than the 1 -3 K 
discrepancy of the ensemble mean with respect to observations during this period. 


Zonal-mean warming is similar in the E2-R NINT, and TCADI ensembles (Figure 9d), with the largest temper- 
ature differences associated with uncorrelated interdecadal variability at high latitudes. The similar twenti- 
eth century trends at each latitude show the offsetting effect of climate sensitivity and tropospheric aerosol 


forcing, even though the latter is confined to specific latitudes. The E2-R TCAD and TCADI ensembles show 

similar temperature trends, except within the 


Table 8. Major Volcanic Eruptions Marked in Figures 3 


Name 

Location 

Date 

Krakatoa 

Indonesia 6°S 

Sep 1883 

Santa Maria 

Guatemala 15°N 

Oct 1902 

Agung 

Bali 8°S 

Jun 1963 

El Chichon 

Mexico 17°N 

Apr 1982 

Pinatubo 

Philippines 15°N 

Jul 1991 


'The date indicates the month that zonal-average column optical 
thickness first exceeds 0.1 according to Sato et al. [1 993]. 


NH midlatitudes where the latter ensemble 
warms by roughly an extra 0.5 K over the entire 
historical period (Figure 9e). (This trend contrast 
is statistically distinct from zero.) This extra 
warming is contrary to expectation. The two 
ensembles are distinguished by their different 
treatment of the indirect effect, and the magni- 
tude of the TCADI AIE is roughly twice the 
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Figure 1 0. Contrast of decadal averages (1996-2005 minus 1 880-1889) of surface air temperature (K). The number at the lower right of each panel is the global average (subject to data 
availability). Dots mark locations where the anomaly (or difference) is outside the 95% confidence interval that results from intraensemble variability. 

TCAD value in the zonal average. (The TCAD AIE has a nearly identical regional distribution and global aver- 
age compared to the NINT ensemble in Figure 5c.) This forcing contrast would ostensibly result in a weaker 
warming trend within the TCADI ensemble as aerosol concentrations increase throughout the twentieth 
century, contrary to the model behavior (Figure 9e). 

The regional pattern of warming over the historical period is shown in Figure 1 0, which contrasts tempera- 
ture during a recent decade (1996-2005) with the first decade of the GISTEMP analyses (1880-1889). For 
this and subsequent plots of regional trends, dots mark locations where the anomaly (or difference) is statis- 
tically distinct from zero, as in Figure 9. The E2-R NINT ensemble exhibits enhanced warming over high lati- 
tudes as described above, but warming is also enhanced over the continents compared to the ocean 
(Figure 1 0a), as expected from the small heat capacity of the land surface and its limited ability to accom- 
modate changes in the incident radiative flux with evaporation. Differences between the E2-R NINT ensem- 
ble and the GISTEMP analysis are generally largest over elevated regions like the Tibetan Plateau, the North 
American Rockies, and Greenland, where the model shows less warming than observed (Figure 10c). In con- 
trast, the model warming is excessive over the North Atlantic and Norwegian Sea. 

The Tropics in the E2-R NINT ensemble warm slightly faster than observed (Figure 9c). The zonal contrast in 
model temperature across the equatorial Pacific is reduced, reminiscent of changes during an El Nino event 
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Figure 1 1. (a) Middle Troposphere and (b) Lower Stratosphere brightness temperature (K) from the Microwave Sounding Unit. Global and 
annual average anomalies are defined relative to the period 1979-2005. Gray shading shows variations among members within the E2-R 
NINT ensemble (plus or minus two standard deviations). 


(Figure 1 0a). In contrast, GISTEMP shows an increase in the zonal contrast and a minimum in warming 
within the Central Pacific resembling a La Nina pattern (Figure 10b). 

The greater globally averaged warming of the E2-H ensembles, compared to those of E2-R (Figure 7a), origi- 
nates within the NH high latitudes (Figure 9f). Ocean heat content trends in the E2-FH ensembles (section 
4.4.) suggest that the excessive surface warming represents accumulated heat that is slow to be exported 
into the deep ocean. This interpretation is consistent with the relatively warm Arctic and Norwegian Sea in 
the E2-FH TCADI ensemble, as well as the relative cooling in the vicinity of the Labrador Sea (Figure 1 0f). The 
latter is a region of Atlantic deepwater formation, where reduced heat export to the deep ocean is associ- 
ated with decreasing heat convergence. 


4.2. Upper Air Temperature 

Greenhouse warming has a characteristic vertical structure with tropospheric warming accompanied by 
stratospheric cooling [Manabe and Weatherald, 1967], We compare model warming above the surface to 
brightness temperatures retrieved from the Microwave Sounding Unit and Advanced Microwave Sounding 
Unit by Remote Sensing Systems (RSS) [ Mears and Wentz, 2009, Version 3.3]. Discrepancies between trends 
derived from different upper air data sets are larger than differences of estimated trends of surface air tem- 
perature [Thome et ai, 201 1], The RSS trends are intermediate compared to two other MSU data sets. We 
use two retrieved variables: TLS ("Temperature of the Lower Stratosphere"), that is influenced by radiation 
originating from a layer centered near 20 km, and TMT ('Temperature of the Middle Troposphere"), whose 
layer of influence is centered near 10 km but extends into the lower stratosphere. Although TLS and TMT 
nominally distinguish between temperature of the stratosphere and troposphere, both retrievals are influ- 
enced by radiation over a broader depth and depend upon temperature at common levels. 

We construct a model brightness temperature using vertical weights that are globally uniform and invariant 
with time, neglecting variations of surface emissivity and the vertical distribution of microwave absorbers, 
among other properties [ Hansen et ai, 1 998], The effect of this approximation has been calculated by Santer 
et al. [1 999], who conclude that the error introduced is no more than a quarter of the global, seasonal trend 
of the midtropospheric TMT retrieval, and far less for the lower stratospheric TLS retrieval. 

Global averages of retrieved and modeled brightness temperatures for the lower stratosphere and middle 
troposphere are shown in Figure 1 1, which shows the expected signature of increasing greenhouse gas 
concentrations with a warming troposphere accompanied by cooling within the stratosphere. The models 
show stratospheric cooling that closely matches the observed trend of -0.37 ± 0.16 K per decade between 
1 979 and 2005 (Table 7), with all ensemble means along with the individual members well within the 
observed trend uncertainty related to interannual variability (Figure 8). This variability primarily reflects the 
large but temporary response following the El Chichon and Pinatubo eruptions that injected volcanic aero- 
sols above the tropopause, increasing the absorption of solar radiation and interrupting the cooling of the 
lower stratosphere (Figure 1 la). The observed increase of the middle troposphere brightness temperature 
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Figure 12. Zonal average of lower stratosphere brightness temperature (K).The average anomaly at each latitude is defined to be zero between 1850 and 1880. Dots mark locations 
where the anomaly (or difference) is outside the 95% confidence interval that results from intraensemble variability. The dates of major volcanic eruptions (listed in Table 8) are marked 
by green dashed lines. 


of 0.1 06 ± 0.078 K per decade is smaller than warming at the surface (Figure 8 and Table 7), at least in part 
because the TMT retrieval includes a contribution from the cooling stratosphere [Mears and Wentz, 2009], 
When the stratospheric contribution is removed, the surface and middle-tropospheric values are compara- 
ble [Fi/ et at., 2004]. 

Model warming of the midtroposphere is slightly larger than the retrieved value between 1979 and 2005 
(Figure 1 1 b), although each ensemble-mean trend is within the observed uncertainty that results from inter- 
annual variations, as are the trends of most ensemble members (Figure 8). Different methods of MSU TMT 
retrieval contribute additional uncertainty of roughly 0.05 K per decade to the observed trend [cf. Thome 
et at., 201 1, Figure 10]. The NINT and TCADI ensembles show the most rapid warming, consistent with their 
trends at the surface (Figure 7). The warming observed within the middle troposphere has slowed in recent 
years, coincident with the surface trend, and for the longer period of 1979-2012, the extended ensembles 
overestimate the MSU trend. Unusually warm years in the MSU TMT record like 1998 and 2010 follow El 
Nino events [cf. Yulaeva and Wallace, 1994]. While the models exhibit interannual variability in sea surface 
temperature in the equatorial East Pacific that resembles El Nino [Schmidt et at., 201 4a], these events are pri- 
marily unforced and thus temporally uncorrelated with observed events. Simulations with prescribed SST 
would allow a more detailed comparison of the model to observed temporal variations of upper air 
temperature. 

Century-scale variations in lower stratospheric temperature are shown as zonal averages of the model 
ensemble means in Figure 1 2. The E2-R NINT ensemble shows stratospheric cooling that begins by the mid- 
twentieth century and is largest at high latitudes, especially within the SH (Figure 12a). The exceptional 
cooling over Antarctica represents the combined effect of increasing greenhouse gas concentrations and 
stratospheric ozone depletion [Shine et al., 2003]. The E2-R TCADI ensemble shows recent tropical cooling 
that is slightly larger compared to the corresponding NINT ensemble (Figure 1 2b), consistent with greater 
ozone depletion in the former [Shindell et al., 201 3b]. The TCADI ensemble also shows additional cooling 
over Antarctica following the Pinatubo eruption (Figure 1 2b), as volcanic aerosols increase the concentra- 
tion of ozone-depleting substances, temporarily amplifying the ozone hole and cooling the lower 
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Figure 13. As in Figure 12 but for middle troposphere brightness temperature. 


stratosphere. (Post-eruption reductions in ozone are not prescribed in the NINT ensembles.) Differences in 
stratospheric temperature trends among the other ensembles are small compared to interdecadal varia- 
tions (Figures 1 2c and 1 2d). 

The corresponding zonal-average trends for the midtroposphere brightness temperature are shown in Fig- 
ure 1 3. At all latitudes, this brightness temperature represents the competing influence of proximal warm- 
ing at the surface with less strongly weighted but larger stratospheric cooling (Table 7). The E2-R NINT 
ensemble shows warming at most latitudes that is temporarily interrupted following major volcanic erup- 
tions (Figure 13a). The exception to warming is at high SFI latitudes, where decadal trends of ozone deple- 
tion augment stratospheric cooling. The E2-R TCADI ensemble shows even greater SFi cooling (Figure 1 3b), 
consistent with its TLS trend (Figure 12b). Brightness trends resulting from the treatment of the indirect 
effect and ocean resemble differences in the surface air temperature (Figures 9c, 9d, 13c, and 13d). 

The regional distribution of trends in lower stratospheric brightness temperature is shown in Figure 14. 
Trends are defined by linear regression at each location between 1979 and 2005. In general, stratospheric 
cooling within the E2-R NINT ensemble is more zonally symmetric than the MSU retrieval (Figures 14b and 
1 4c). The observed zonal asymmetry exceeds two standard deviations of the individual ensemble members, 
so its contrast with the model is not simply the result of ensemble averaging. Differences among the 
ensembles due to their treatment of atmospheric composition, the indirect effect, and ocean are smaller. 
(Figures 14d-14f). 

Corresponding regional trends in middle troposphere brightness temperature are shown in Figure 15. The 
E2-R NINT ensemble generally reproduces the observed contrast between Antarctic cooling and warming at 
all other latitudes. Flowever, the NINT ensemble overestimates the observed tropical warming, especially in 
the Central Pacific, as at the surface (Figure 1 0c), while underestimating the NFH high latitude warming, 
especially over the Norwegian Sea and Arctic (Figure 15c). Differences between the ensembles are smaller 
and seem to be unsystematic, exceeding intraensemble variations in only a few locations. 
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Figure 14. Change in lower stratosphere brightness temperatures {K per decade) between 1 979 and 2005. The number at the lower right of each panel is the global average (subject to 
data availability). Dots mark locations where the anomaly (or difference) is outside the 95% confidence interval that results from intraensemble variability. 


4.3. Sea Ice 

The annual-average sea-ice area is shown in Figure 16 as a percentage of the total hemispheric area. For 
comparison, sea-ice area distributed by the National Snow and Ice Data Center (NSIDC) is plotted [Fetterer 
et al., 201 1], including a contribution from the unobserved northernmost latitudes of the Arctic Ocean (due 
to the satellites' inclined orbit) that assumes this region is always ice-covered. 

The E2-R ensembles show excellent agreement with the Northern Flemisphere (NFH) NSIDC annual average 
(Figure 1 6a). This performance is a clear improvement over the CMIP3 ModelE-R that overestimated the 
observed NFH annual value [ Schmidt et al., 2006; Hansen et al., 2007]. The agreement in Figure 16a results 
partly from compensating errors with respect to the observed seasonal cycle. All ModelE2 ensembles have 
too little sea-ice area at the September annual minimum, while the E2-R ensembles have too much at the 
March maximum [ Schmidt et al., 201 4a]. The E2-FH ensembles underestimate the NSIDC NFH annual value, 
consistent with their warmer temperatures at high latitudes (Figure 1 0f). 

Despite biases in the seasonal cycle, almost all ModelE2 ensembles reproduce the observed rate of decrease 
of NFH ice area, both in the annual average and at the annual minimum and maximum (Figure 1 7). The 
exception is the E2-R TCAD ensemble that exhibits slow trends of both melting and warming in the late- 
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Figure 15. As in Figure 14 but for middle troposphere brightness temperature. 


twentieth century (Figure 7). The E2-H ensembles reproduce the observed rate of melting, but due to insuf- 
ficient present-day ice cover lead to estimates of an ice-free Arctic that are among the earliest of the CMIP5 
models [Sfroeve et a!., 201 2]. 

All ensembles underestimate Antarctic sea-ice area, simulating about half the retrieved annual average, 
while failing to reproduce the slight upward trend in the observations (Figure 16b). In this region, both 
ocean models exhibit excessive vertical mixing of warm water to the surface, an error that is exacerbated 
during the summer by insufficient low clouds and excessive surface shortwave absorption [ Schmidt et al., 
2014a], 


4.4. Ocean Heat Content 

The total forcing in Figure 2b represents an inferred energy imbalance at TOA that must be compensated at 
equilibrium, for example, by an increase in longwave radiation to space. Outgoing longwave radiation (OLR) 
increases with the atmospheric temperature, which at equilibrium is tightly coupled to the temperature of 
the upper ocean. This approach to equilibrium takes at least several centuries because the deep ocean 
removes heat from the upper ocean, slowing the rate at which the atmosphere warms and OLR increases. 
The incomplete compensation of the forcing is shown in Figure 18, which shows the net radiative flux at 
TOA for each ensemble along with the instantaneous forcing from Figure 2b. The net flux is nearly identical 
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Figure 16. Model sea-ice area compared to passive microwave retrievals from the National Snow & Ice Data Center for the (a) Northern 
Hemisphere and (b) Southern Hemisphere. Plotted area is an annual average and hemispheric sum expressed as a percentage of the total 
hemispheric area. 


for all the ensembles, suggesting similar OLR, despite different rates of surface warming. This similarity 
could result from compensation between the lapse-rate and water vapor feedbacks that is a consistent fea- 
ture of climate models [ Soden and Held 2006; Held and Shell 201 2]. 

Warming of the ocean is indicated by an increase in the ocean heat content, shown for the historical period 
in Figure 19a. This heat content is a sum over all depths, and increases gradually with the forcing for all 
ensembles, despite temporary interruptions following major volcanic eruptions. Shading shows the range 
of variability among the members of the E2-R NINT ensemble, measured here by plus or minus twice the 
standard deviation of the intraensemble variations. The heat content of the E2-H ensembles fall below this 
range, indicating that ocean heat uptake is significantly smaller in these ensembles. 

While the entire ocean column in the E2-R ensembles takes up more heat compared to the E2-H ensembles, 
the latter store more heat in the upper ocean (Figure 1 9a), the layer extending to depths of 776 and 700 m 
in the E2-R and E2-FH ensembles, respectively. This depth is roughly the extent of the gyre circulations, and 
heat penetrates below this level mainly in regions of deepwater formation by ocean deep convection. Thus, 
the E2-FI TCADI ensemble accumulates more heat in the upper ocean compared to the corresponding E2-R 
model, despite taking up less heat when summed over all depths. This is consistent with the more rapid 
warming at the surface in the E2-FI TCADI ensemble (Figure 7a). It is also consistent with the reduced rate of 
warming in the Labrador Sea in this ensemble compared to E2-R (Figure 1 0f) that indicates a reduction of 
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Figure 17. Change in NH sea-ice area (per decade and as a percentage of total hemispheric area) between 1979 and 2005. The observed 
value from the NSIDC is in purple with its uncertainty in black (corresponding to two standard deviations of interannual variability). Ensem- 
ble average change is shown as triangles (NINT), diamonds (TCAD), and circles (TCADI), with change for individual ensemble members 
plotted as small triangles. 
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heat convergence associated with sinking 
of surface water into the deep ocean. In 
contrast, the E2-R ensembles transfer heat 
into the deep ocean to a greater extent, 
slowing the warming at the surface. 

Observations of the global heat content of 
the upper ocean extend back to the mid- 
twentieth century. Figure 19b shows the 
heat content of the upper 700 m, accord- 
ing to analysis of subsurface temperature 
measurements by Domingues et al. [2008], 
extended to the present by Church et al. 
[201 1 ]. The uncertainty, indicated by pur- 
ple shading, is largest early in the record, 
and arises from a combination of measure- 
ment error and limited spatial coverage, 
especially within the Southern Ocean. 
Other global estimates of heat content 
within the upper ocean are based upon different treatments of the measurement limitations, but yield trends 
that are consistent, given the uncertainty [e.g., Levitus et al., 2005]. During the past decade, global coverage of 
the ocean has been achieved with more accurate profilers, resulting in more confident estimates of the 
observed global trend. Figure 19b shows the estimate during the past two decades by Lyman et al. [2010], 
with gray shading denoting error bars that are small compared to estimates during previous decades. The 
warming associated with the E2-R ensembles is in agreement with the trend inferred by both observational 
analyses. In contrast, the E2-H ensembles warm excessively, possibly because of insufficient heat export to the 
deep ocean below 700 m. 

The reduction of heat export to the deep ocean when an isopycnal OGCM is replaced with a model based 
upon height coordinates is consistent with behavior exhibited by the Geophysical Fluid Dynamics Labora- 
tory coupled model [Hallberg et al., 2012]. Measuring heat export and the sensitivity of model export to sub- 
grid mixing remains an important challenge. 


Net TOA Flux 



Figure 1 8. Net radiative flux at TOA (Wm~ 2 ) for the ensemble averages. The 
black line shows instantaneous radiative forcing at the tropopause ("iRF") for 
the E2-R NINT ensemble. Ensemble values are global and annual averages. 
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Figure 19. (a) Increase in ocean heat content since 1850 in the NINT and TCADI ensembles. Red and blue lines indicate heat content 
summed over all depths; light red and blue lines indicate heat within the upper 776 and 700 m of the ocean for the E2-R and E2-H ensem- 
bles, respectively. Pink and light pink shading shows the intraensemble uncertainty in the E2-R NINT ensemble (equal to plus or minus two 
standard deviations) for the total and upper ocean heat content, (b) Trends in upper ocean heat content compared to measurements in 
the upper 700 m analyzed by Church et al. [201 1 ] and Lyman et al. [201 0], whose uncertainties are denoted by purple and gray shading, 
respectively. Observed anomalies are adjusted so that their time average matches the E2-R NINT average over the same period. Vertical 
lines mark major eruptions of tropical volcanoes. 
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Figure 20. (a) Global thermosteric sea-level rise (m) since the preindustrial in the E2-R ensembles, along with the observational estimate 
from density changes in the upper 700 m by Church et al. [2011]. The observations are offset to have the same time-average as the NINT 
ensemble over their common period, so that trends can be more easily compared. Pink shading shows variations among the NINT ensem- 
ble members (plus or minus two standard deviations), while the gray shading represents twice the standard error of the observational esti- 
mate of Church et al. [2011] based upon a combination of tide gauges and satellite altimetry retrievals. Ensemble-mean trends are 
computed between 1900 and 1990. Vertical lines mark major eruptions of tropical volcanoes, (b) excerpt since 1950. 


4.5. Sea Level 

Warming during the historical period is associated with an expansion of ocean volume. Density changes 
(including the effect of salinity anomalies) contribute a little under half of the observed rise in sea level dur- 
ing the past four decades [Church et al., 201 1]. Melting of glaciers and the continental ice sheets of Green- 
land and Antarctica supply most of the remainder. The observed rise in sea level is slightly reduced by an 
increase in terrestrial water storage by river dams offset by groundwater depletion. The model does not 
simulate the dynamical disintegration of land ice and has unphysical trends in lake storage, so we present 
in Figure 20 only the density-related or thermosteric changes in E2-R sea level. (Values for the E2-H model 
are unavailable due to a diagnostic error.) For comparison, the observed thermosteric sea-level rise limited 
to temperature and salinity changes above 700 m is also plotted [Church et al., 2011], Density changes 
below this depth are omitted due to infrequent measurements prior to the last decade. (To facilitate com- 
parison of trends, the observations are given the same time-averaged value as the E2-R NINT ensemble 
over their common period.) 

Figure 20 shows that thermosteric sea level is reduced in both the model and observations following major 
volcanic eruptions. The model trend has increased in the past two decades (Figure 20b), consistent with 
measurements of total sea level by both tide gauges and satellite altimeters [Church et al., 201 1; Nerem 
et al., 2010], Thermosteric trends in each ensemble after 1 990 are three times the trend earlier in the cen- 
tury. Between 1993 and 2009, the modeled thermosteric contribution to sea level increases by just over 1 
mm per year in each ensemble: twice the observed trend of 0.65 ± 0.1 2 mm per year. The omission of the 
observed thermosteric change below 700 m makes an unknown contribution to this discrepancy. Excessive 
warming during the past decade by the NINT and TCADI ensembles also contributes (Figure 7b). 

Since the preindustrial, the total sea level rise is estimated near 210 mm [Church and White, 201 1]. The ther- 
mosteric change of the NINT and TCADI ensembles since 1850 is 65 ±15 mm, about one-third of the total 
observed value. 


4.6. Precipitation 

Climatological precipitation varies over a smaller spatial scale than temperature, making precipitation 
anomalies more difficult to detect with present-day in situ measurements. Satellites offer extensive spatial 
coverage (at least within the Tropics) but retrieving rain rates from measured radiances is not straightfor- 
ward. Evaluation of climatological precipitation in each ensemble during the satellite era is described by 
Schmidt et al. [2014a]. 

Precipitation changes are related to the tropospheric radiative divergence anomaly [Allen and Ingram, 
2002], The warming associated with rising greenhouse gas concentrations, for example, increases this 
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divergence, requiring greater latent heating 
(and therefore precipitation) to maintain 
energy balance. In contrast, many aerosol 
species reduce this divergence, and thus 
precipitation. 

Global precipitation increases over the his- 
torical period in each ensemble (Figure 21), 
coincident with the increasing temperature. 
The total increase since the preindustrial is 
on the order of 0.02 mm day -1 compared 
to the observed global annual mean near 3 
mm day -1 [Stephens etal., 2012]. The larg- 
est absolute increase of precipitation by the 
end of the historical period is associated 
with the E2-FH NINT ensemble (Figure 21), 
whose warming is also largest (Figure 7a 
and 1 1 b). 

For each ensemble, global precipitation 
changes during the twentieth century by 
roughly 0.6% (E2-R) or 0.7% (E2-H) per 
degree K of warming. Precise estimation of the fractional increase of global precipitation is difficult because 
variability is large compared to trends over the historical period. Global precipitation decreases following 
Pinatubo [Trenberth and Dai 2007]. The model decrease following this eruption is over twice as large as the 
simulated increase over the entire twentieth century, and comparable reductions occur following other 
major eruptions (Figure 21). Comparison of the model fractional increase over the twentieth century to 
changes in other models compiled by Held and Soden [2006, Figure 2d] suggests that the ModelE2 sensitiv- 
ity during this period is on the low side, although a few of these other models show a decrease in 
twentieth-century precipitation. 

Precipitation sensitivity during the historical period is influenced by aerosol forcing that varies among mod- 
els to a greater extent than forcing by long-lived greenhouse gases [e.g., Shindell et at., 201 3a]. As a more 
robust comparison of sensitivity with other models, we calculate the fractional change of precipitation in an 
idealized CMIP5 experiment where aerosol forcing is absent and the climate is perturbed solely by a 1% per 
year increase of the C0 2 concentration until quadrupling. Over the initial 1 50 years of this simulation, the 
fractional precipitation increase is 1.3% (E2-R) or 1.4% (E2-FI) per degree K of warming. This is virtually iden- 
tical to the mean increase calculated among the CMIP5 models by Pendergrass and Hartmann [2014]. For 
comparison, the fractional increase for the CMIP3 counterpart to the E2-R NINT model is over twice as large 
[Lambert and Webb, 2008], for reasons that bear further investigation. The higher precipitation increase dur- 
ing the experiment forced only by increasing C0 2 compared to the historical experiments illustrates the 
ability of aerosols to reduce precipitation. 

Zonal-average changes in precipitation during the historical period are shown in Figure 22. A 5 year low- 
pass filter is applied to remove residual El Nino variability that is not completely damped by averaging over 
ensemble members. Without filtering, this variability obscures multidecadal trends along with differences 
between the ensembles. In the E2-R NINT ensemble, precipitation increases in the deep Tropics, while 
decreasing in the subtropics (Figure 22a). This spatial pattern is consistent with trends calculated from 
Global Precipitation Climatology Project retrievals [Allan et al., 2010], and theoretical expectations of precipi- 
tation sensitivity to increasing temperature and net radiation into the surface [Held and Soden, 2006]. The 
precipitation increase is also consistent with the "upped-ante" mechanism where tropical precipitation 
intensifies and contracts areally where it is already large. (This is because latent heat release increases the 
upper tropospheric temperature, increasing the static stability throughout the entire Tropics, including the 
edge of the convecting region that originally was only marginally unstable) [Chou and Neelin [2004], The 
ensemble also shows increased precipitation within the Northern Flemisphere extratropical storm track as 
well as near the Antarctic coast (Figure 22a). Differences between the models are relatively small compared 
to the E2-R NINT trend (Figures 22b-22d). 


Global Precipitation 



Figure 21. Global precipitation anomaly (in tenths of a mm day -1 ). Shading 
represents variations among ensemble members within the E2-R NINT 
ensemble (plus or minus two standard deviations). Major volcanic eruptions 
are marked with vertical lines. 
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Figure 22. Zonal average precipitation anomaly compared to the control experiment (in tenths of a mm day -1 ). The average anomaly at each latitude is defined to be zero between 
1850 and 1880. The dates of major volcanic eruptions (listed in Table 8) are marked by green dashed lines. To remove uncorrelated El Nino variability between the simulations that is not 
completely damped by ensemble averaging, a 5 year low-pass filter is applied. Dots mark locations where the anomaly (or difference) is outside the 95% confidence interval that results 
from intraensemble variability. 




Figure 23. Contrast of decadal averages of precipitation (1 996-2005 minus 1 880-1 889 in mm/day). The number at the lower right of each panel is the global average. Dots mark loca- 
tions where the anomaly (or difference) is outside the 95% confidence interval that results from intraensemble variability. 
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RCP8.5: NH Annular Mode (Oct-Mar) b) RCP8.5: SH Annular Mode (Nov-Feb) 



1900 2000 2100 2200 2300 



Figure 24. Leading principal component (PC) of sea-level pressure (hPa) during the historical period and RCP8.5 scenario. The PC repre- 
sents the pressure anomaly corresponding to the annular mode, averaged poleward of 60° for (a) the NH (October-March) and (b) the SH 
(November-February). The colored lines represent the PC of each ensemble after filtering to remove subdecadal frequencies. To allow 
comparison of long-term trends with interannual variations, the gray line shows the unfiltered seasonal average PC for the E2-R NINT 
ensemble. The black line shows the corresponding NCEP PC between 1948 and 2012, with only periods shorter than 5 years filtered to pre- 
serve PC values at the beginning and end. The model PCs are constructed to average zero over their duration. To facilitate comparison of 
trends, the observed PCs are offset to have the same time average as the E2-R NINT PCs during their period of overlap. 


Regional differences in decadal-mean rainfall between the late nineteenth century (1880-1889) and the 
end of the twentieth century (1 996-2005) are shown in Figure 23. In the NINT ensemble, the most promi- 
nent feature is the displacement of rainfall from the West Pacific and subtropics toward the Central Pacific 
(Figure 23a), consistent with the El Nino-like warming in Figure 10a. Despite this warming, the ensemble 
predicts drying over Mexico and the American Southwest, in contrast to the canonical El Nino precipitation 
anomaly in these regions [ Ropelewski and Halpert, 1 996; Seager et at., 2005], There is an increase over the 
interior of the Eurasian continent, eastern North America, and within the Atlantic and Pacific storm tracks. In 
contrast, precipitation is reduced over the Mediterranean and the Sahel. In the E2-FH TCADI ensemble, pre- 
cipitation increases over the entire equatorial Pacific, with a maximum west of the dateline (Figure 23d). 

4.7. Circulation Trends 

The climate anomalies described above are coupled to trends in the atmospheric and ocean circulation. 
Annular modes are the leading form of atmospheric variability in the extratropics of both hemispheres 
[Thompson and Wallace, 2000]. By convention, the positive phase of each mode represents an equatorward 
displacement of mass from high latitudes to the subtropics, with a poleward shift of the westerlies within 
the troposphere and at the surface. Previous studies have shown that both stratospheric ozone loss and 
increasing concentrations of greenhouse gases drive positive annular trends [e.g., Shindell and Schmidt, 
2004], although not all models simulate this response [Miller et at., 2006b], 

Figure 24 shows a positive annular trend within each hemisphere between the start of the historical period 
and the end of the twenty-third century, assuming the RCP8.5 scenario for future atmospheric composition, 
whose radiative forcing increases to approximately 8.5 and 12 WtrT 2 by 2100 and 2300, respectively [van 
Vuuren et al., 201 1]. (We include model output past the end of the historical period to distinguish forced 
trends from natural variability.) Ensembles based upon the RCP4.5 scenario with weaker anthropogenic 
forcing also show positive, albeit smaller, annular trends. Annular trends are represented by the first princi- 
pal component (PC) of sea-level pressure (SLP) in each hemisphere, rescaled to represent the average pres- 
sure anomaly poleward of 60° in hPa. The PC is calculated from monthly anomalies between October and 
March in the NH, and November through February in the SH for comparison to trends in the CMIP3 models 
calculated by Miller et al. [2006b]. The colored lines in Figure 24 show the PC for each ensemble after filter- 
ing subdecadal time scales. To allow comparison of the trend to interannual variations, the gray line shows 
unfiltered seasonal averages of the PC for the E2-R NINT ensemble. Both hemispheres show a positive trend, 
and by association, a poleward displacement of the westerlies. The trend is apparent first in the SH, by the 
end of the twentieth century, due to stratospheric ozone depletion that augments the effect of greenhouse 
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gases. The SH trend is seen first in the TCAD and TCADI ensembles, where prognostic ozone loss is slightly 
stronger than the observed depletion used to prescribe the NINT ozone concentration [Shindell et al., 
2013b]. Because of their excessive depletion, the TCAD and TCADI ensembles exhibit greater ozone recov- 
ery, consistent with their smaller PC in later centuries compared to the NINT ensemble. For comparison, the 
figure shows the annular PC calculated from National Center for Environmental Prediction (NCEP) SLP 
between 1 948 and 201 2 [ Kalnay et al., 1 996]. Because each time-averaged PC is originally zero by construc- 
tion, only the ensemble trends (and not the individual PC values) can be meaningfully compared between 
the reanalysis value. (To facilitate comparison of trends, we shifted the observed PC in Figure 24 so that its 
time-mean equals that of the E2-R NINT PC during the observing period.) In the SH, late twentieth-century 
trends in the ensembles are similar to the reanalyzed PC trend, although the comparison remains imprecise 
due to decadal variability. (Only periods shorter than 5 years are filtered from the NCEP PC to preserve its 
values at the beginning and end of the reanalysis period.) In the NH, ensemble trends are not distinguish- 
able from decadal variations until almost the end of the twenty-first century, consistent with the indistinct 
trend in the present-day reanalysis PC. 


5. Conclusions 

We have described the forcing and response in the NASA GISS ModelE2 CGCM for the CMIP5 historical 
period between 1850 and 2005, with extensions of some ensembles through 2012. There are three versions 
of the atmospheric model distinguished by their treatment of atmospheric constituents and the aerosol 
indirect effect. In the NINT model, atmospheric composition is specified based upon previous offline calcula- 
tions, while the indirect effect consists of an empirical relation between aerosol mass and low cloud cover 
whose global magnitude is tuned [Hansen et al., 2005]. In the TCADI model, ozone and aerosol concentra- 
tions are calculated interactively as a function of atmospheric chemistry and climate, while the concentra- 
tions of all long-lived greenhouse gases other than C0 2 are prescribed at the surface but calculated above 
[Shindell et al., 2013b]. The first indirect effect is implemented as a prognostic equation for the number of 
available CCN that balances supply by aerosols and removal by wet deposition. The TCAD model is a hybrid, 
sharing prognostic atmospheric constituents with TCADI but an empirical AIE with NINT. Each of the three 
versions of the ModelE2 AGCM is coupled to either the FIYCOM or the Russell OGCM, resulting in six ver- 
sions of the coupled ModelE2, each integrated as a five-member ensemble. 

The total forcing by the end of the historical period is in the upper quartile of CMIP5 models [Forster et al., 

201 3], Compared to the previous (CMIP3) generation of ModelE, year-2003 forcing in the E2-R NINT ensemble 
is higher by 0.6 Wm -2 . This difference is mainly attributable to larger forcing by long-lived greenhouse gases, 
along with reduced direct and indirect forcing by aerosols (cf. Table 6). Total forcing is smaller in the E2-R 
TCADI ensemble due to greater stratospheric ozone depletion and more negative aerosol forcing. 

The E2-R NINT ensemble reproduces the observed warming of the surface and troposphere along with 
stratospheric cooling until the end of the historical period in 2005 (Figure 8). Flowever, the ensemble contin- 
ues to warm at a steady rate without the slowdown observed during the last decade. The warming discrep- 
ancy is larger than unforced differences among the individual ensemble members, pointing to possible 
errors related to forcing or some aspect of the model that we have yet to identify. The E2-R NINT ensemble 
matches the observed upward trend of heat content in the upper 700 m of the ocean. (We are investigating 
whether this agreement remains after correcting an error in the subgrid mixing parameterization.) The 
ensemble reproduces the annual loss of NH sea ice observed during recent decades, along with observed 
trends of maximum and minimum area (Figure 1 7). Flowever, sea-ice area in the SH decreases in contrast to 
the small observed increase. Precipitation increases in the deep Tropics during the late twentieth century at 
the expense of the subtropics, associated with an eastward displacement of precipitation toward the equa- 
torial Central Pacific that resembles El Nino. 

Trends in the other ensembles with respect to NINT are influenced by their different climate sensitivity, forc- 
ing, and ocean heat uptake. Temperature and sea-ice trends are similar in the E2-R TCADI ensemble with its 
greater climate sensitivity offset by smaller ozone forcing and larger aerosol-cloud interactions. The TCAD 
ensembles warm the least, sharing the relatively low NINT climate sensitivity, but smaller ozone forcing 
with the TCADI ensembles. Warming is largest in the E2-H ensembles, where heat accumulates at the sur- 
face due to small export to the deep ocean (Figure 19). By the end of the twenty-first century, when forcing 
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by long-lived greenhouse gases dominates the total forcing, the E2-H TCADI ensemble is the warmest, due 
to its elevated climate sensitivity and slow heat uptake by the deep ocean, as will be reported elsewhere. 

The contrast between the TCADI and TCAD climates shows the effect of a mechanistic treatment of the AIE. 
Schmidt et al. [2014a] shows that cloud fraction errors are reduced in the TCADI ensembles downwind of 
regions of large aerosol emission like the North Pacific, the Labrador Sea, and Gulf of Guinea. Attribution of 
reduced errors to a mechanistic AIE treatment is challenging due to the complexity of cloud formation, but 
more comprehensive studies have found a similar reduction [Ekman, 2014]. The NH midlatitudes warm 
faster in the TCADI ensembles (Figure 9e), even though the aerosol forcing is more negative. This unantici- 
pated contrast of trends shows the limitation of simplified, empirical treatments of the AIE that cannot 
anticipate the behavior emerging from more mechanistic representations. 

All ensembles show positive annular trends corresponding to a decrease in Arctic and Antarctic surface 
pressure with a corresponding poleward shift of the midlatitude westerlies. Annular trends are apparent in 
the SH by the end of the twentieth century, associated with stratospheric ozone depletion that augments 
the effect of increasing greenhouse gas concentrations. The trend appears first in the TCAD and TCADI 
ensembles with prognostic ozone, whose loss slightly exceeds the observed value used to prescribe deple- 
tion in the NINT ensembles. In contrast, annular trends in the NH, where stratospheric ozone loss is negligi- 
ble, cannot be distinguished from unforced variability until the late twenty-first century. 

Particular aspects of model behavior deserve further investigation: for example, the increase in long-lived 
greenhouse gas forcing and fractional precipitation sensitivity compared to the previous generation 
(CMIP3) model. It is also of interest what aspect of climate change between the preindustrial and present 
day causes direct forcing by the same aerosol distribution to become substantially more negative (Table 5). 

We are preparing new TCADI simulations that correct deficiencies while incorporating updated forcing val- 
ues that were not available at the time of the original simulations. Preliminary analysis suggests that the pre- 
scribed forcing was biased upward, and that corrected values from observations will reduce the simulated 
warming trend toward the observed value [Schmidt et al., 2014b]. Given the remaining uncertainty associ- 
ated with aerosol forcing and deep ocean heat uptake, reproduction of the observed warming is not a dem- 
onstration that the model has the correct climate sensitivity. Previous generation (CMIP3) models simulated 
the twentieth-century warming despite a wide range of sensitivity [ Kiehl , 2007]. However, in the latest gener- 
ation of models, sensitivity and forcing are nearly uncorrelated [Forster et al., 201 3; Shindell et al., 201 3a], so 
reproduction of the observed climate is a more stringent test. Historical simulations remain an important 
opportunity for comparison to observed climate trends and processes that helps to assess projections of 
future climate change [Offo et al., 201 3; Sherwood et al., 201 4]. 
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